Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 46059 |
| Missing cells | 23313 |
| Missing cells (%) | 3.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 52.2 MiB |
| Average record size in memory | 1.2 KiB |
Variable types
| CAT | 8 |
|---|---|
| NUM | 6 |
| BOOL | 2 |
is_retweet has constant value "46059" | Constant |
user_name has a high cardinality: 25553 distinct values | High cardinality |
user_location has a high cardinality: 9345 distinct values | High cardinality |
user_description has a high cardinality: 24494 distinct values | High cardinality |
user_created has a high cardinality: 25871 distinct values | High cardinality |
date has a high cardinality: 45622 distinct values | High cardinality |
text has a high cardinality: 46018 distinct values | High cardinality |
hashtags has a high cardinality: 16835 distinct values | High cardinality |
source has a high cardinality: 171 distinct values | High cardinality |
user_location has 10365 (22.5%) missing values | Missing |
user_description has 3090 (6.7%) missing values | Missing |
hashtags has 9816 (21.3%) missing values | Missing |
user_friends is highly skewed (γ1 = 37.72401569) | Skewed |
retweets is highly skewed (γ1 = 88.08390544) | Skewed |
favorites is highly skewed (γ1 = 66.48283436) | Skewed |
date is uniformly distributed | Uniform |
text is uniformly distributed | Uniform |
id has unique values | Unique |
user_favourites has 671 (1.5%) zeros | Zeros |
retweets has 30075 (65.3%) zeros | Zeros |
favorites has 19255 (41.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-16 01:26:18.473024 |
|---|---|
| Analysis finished | 2021-05-16 01:26:48.238165 |
| Duration | 29.77 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 46059 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.367027866e+18 |
|---|---|
| Minimum | 1.337727768e+18 |
| Maximum | 1.378952133e+18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 360.0 KiB |
Quantile statistics
| Minimum | 1.337727768e+18 |
|---|---|
| 5-th percentile | 1.346330701e+18 |
| Q1 | 1.363362996e+18 |
| median | 1.368339972e+18 |
| Q3 | 1.373482143e+18 |
| 95-th percentile | 1.37768748e+18 |
| Maximum | 1.378952133e+18 |
| Range | 4.122436498e+16 |
| Interquartile range (IQR) | 1.011914633e+16 |
Descriptive statistics
| Standard deviation | 9.076527862e+15 |
|---|---|
| Coefficient of variation (CV) | 0.006639607056 |
| Kurtosis | 1.402304167 |
| Mean | 1.367027866e+18 |
| Median Absolute Deviation (MAD) | 5.077735542e+15 |
| Skewness | -1.265038672 |
| Sum | 5.198936432e+18 |
| Variance | 8.238335802e+31 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1.364449897e+18 | 1 | < 0.1% | |
| 1.377326662e+18 | 1 | < 0.1% | |
| 1.378623397e+18 | 1 | < 0.1% | |
| 1.37294314e+18 | 1 | < 0.1% | |
| 1.344406587e+18 | 1 | < 0.1% | |
| 1.374362908e+18 | 1 | < 0.1% | |
| 1.366695895e+18 | 1 | < 0.1% | |
| 1.362913572e+18 | 1 | < 0.1% | |
| 1.37190737e+18 | 1 | < 0.1% | |
| 1.370372644e+18 | 1 | < 0.1% | |
| 1.365974589e+18 | 1 | < 0.1% | |
| 1.374436836e+18 | 1 | < 0.1% | |
| 1.373732669e+18 | 1 | < 0.1% | |
| 1.366220823e+18 | 1 | < 0.1% | |
| 1.365974535e+18 | 1 | < 0.1% | |
| 1.367100394e+18 | 1 | < 0.1% | |
| 1.36859012e+18 | 1 | < 0.1% | |
| 1.373767181e+18 | 1 | < 0.1% | |
| 1.358497844e+18 | 1 | < 0.1% | |
| 1.366245596e+18 | 1 | < 0.1% | |
| 1.369686437e+18 | 1 | < 0.1% | |
| 1.358480395e+18 | 1 | < 0.1% | |
| 1.370425425e+18 | 1 | < 0.1% | |
| 1.354223068e+18 | 1 | < 0.1% | |
| 1.369791286e+18 | 1 | < 0.1% | |
| Other values (46034) | 46034 | 99.9% |
| Value | Count | Frequency (%) | |
| 1.337727768e+18 | 1 | < 0.1% | |
| 1.337728702e+18 | 1 | < 0.1% | |
| 1.337732077e+18 | 1 | < 0.1% | |
| 1.337732996e+18 | 1 | < 0.1% | |
| 1.337733049e+18 | 1 | < 0.1% | |
| 1.337733857e+18 | 1 | < 0.1% | |
| 1.337733928e+18 | 1 | < 0.1% | |
| 1.33773407e+18 | 1 | < 0.1% | |
| 1.337735596e+18 | 1 | < 0.1% | |
| 1.337739608e+18 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.378952133e+18 | 1 | < 0.1% | |
| 1.37895209e+18 | 1 | < 0.1% | |
| 1.378949363e+18 | 1 | < 0.1% | |
| 1.378948927e+18 | 1 | < 0.1% | |
| 1.378946878e+18 | 1 | < 0.1% | |
| 1.378946025e+18 | 1 | < 0.1% | |
| 1.378945931e+18 | 1 | < 0.1% | |
| 1.378943663e+18 | 1 | < 0.1% | |
| 1.378941453e+18 | 1 | < 0.1% | |
| 1.378941317e+18 | 1 | < 0.1% |
| Distinct | 25553 |
|---|---|
| Distinct (%) | 55.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 360.0 KiB |
| Workout Solutions | 1026 |
|---|---|
| Sputnik | 284 |
| Xukki🌍 | 219 |
| China Economy | 184 |
| Sputnik V | 170 |
| Other values (25548) |
| Value | Count | Frequency (%) | |
| Workout Solutions | 1026 | 2.2% | |
| Sputnik | 284 | 0.6% | |
| Xukki🌍 | 219 | 0.5% | |
| China Economy | 184 | 0.4% | |
| Sputnik V | 170 | 0.4% | |
| ILKHA | 135 | 0.3% | |
| MaryRobotic | 132 | 0.3% | |
| William Owen | 126 | 0.3% | |
| Tradia Inc | 121 | 0.3% | |
| Shen Shiwei沈诗伟 | 119 | 0.3% | |
| People's Daily, China | 110 | 0.2% | |
| New Straits Times | 91 | 0.2% | |
| Brazil SFE | 90 | 0.2% | |
| ChineseEmbassyManila | 88 | 0.2% | |
| CGTN | 79 | 0.2% | |
| The Peninsula Qatar | 73 | 0.2% | |
| CCTV+ | 68 | 0.1% | |
| People's Daily app | 65 | 0.1% | |
| RT | 58 | 0.1% | |
| Tibetans | 56 | 0.1% | |
| ME | 55 | 0.1% | |
| China News 中国新闻网 | 55 | 0.1% | |
| RiverRising | 55 | 0.1% | |
| IANS Tweets | 55 | 0.1% | |
| @shalinisharma87 | 53 | 0.1% | |
| Other values (25528) | 42492 | 92.3% |
Frequencies of value counts
Unique
| Unique | 20054 ? |
|---|---|
| Unique (%) | 43.5% |
Histogram of lengths of the category
Length
| Max length | 50 |
|---|---|
| Median length | 13 |
| Mean length | 14.35502291 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 55991 | 8.5% | |
| 55588 | 8.4% | ||
| e | 46246 | 7.0% | |
| i | 40686 | 6.2% | |
| n | 36151 | 5.5% | |
| r | 32669 | 4.9% | |
| o | 30137 | 4.6% | |
| s | 24274 | 3.7% | |
| t | 24086 | 3.6% | |
| l | 22904 | 3.5% | |
| h | 19074 | 2.9% | |
| u | 15092 | 2.3% | |
| d | 12946 | 2.0% | |
| m | 11312 | 1.7% | |
| S | 11175 | 1.7% | |
| c | 11144 | 1.7% | |
| y | 9855 | 1.5% | |
| M | 8836 | 1.3% | |
| k | 8710 | 1.3% | |
| A | 8454 | 1.3% | |
| g | 7259 | 1.1% | |
| C | 7102 | 1.1% | |
| T | 6998 | 1.1% | |
| D | 6623 | 1.0% | |
| R | 6099 | 0.9% | |
| Other values (2838) | 141767 | 21.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 442794 | 67.0% | |
| Uppercase Letter | 117176 | 17.7% | |
| Space Separator | 55594 | 8.4% | |
| Other Symbol | 15002 | 2.3% | |
| Other Punctuation | 8903 | 1.3% | |
| Other Letter | 7412 | 1.1% | |
| Decimal Number | 4198 | 0.6% | |
| Nonspacing Mark | 2830 | 0.4% | |
| Format | 1303 | 0.2% | |
| Dash Punctuation | 1238 | 0.2% | |
| Spacing Mark | 1027 | 0.2% | |
| Close Punctuation | 801 | 0.1% | |
| Open Punctuation | 779 | 0.1% | |
| Connector Punctuation | 635 | 0.1% | |
| Math Symbol | 568 | 0.1% | |
| Modifier Symbol | 336 | 0.1% | |
| Final Punctuation | 169 | < 0.1% | |
| Modifier Letter | 107 | < 0.1% | |
| Initial Punctuation | 87 | < 0.1% | |
| Currency Symbol | 71 | < 0.1% | |
| Private Use | 52 | < 0.1% | |
| Enclosing Mark | 48 | < 0.1% | |
| Other Number | 48 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 11175 | 9.5% | |
| M | 8836 | 7.5% | |
| A | 8454 | 7.2% | |
| C | 7102 | 6.1% | |
| T | 6998 | 6.0% | |
| D | 6623 | 5.7% | |
| R | 6099 | 5.2% | |
| N | 5746 | 4.9% | |
| P | 5609 | 4.8% | |
| B | 5555 | 4.7% | |
| E | 4715 | 4.0% | |
| I | 4307 | 3.7% | |
| H | 4103 | 3.5% | |
| K | 3945 | 3.4% | |
| L | 3886 | 3.3% | |
| J | 3722 | 3.2% | |
| G | 3657 | 3.1% | |
| W | 3589 | 3.1% | |
| F | 3012 | 2.6% | |
| V | 2631 | 2.2% | |
| O | 2605 | 2.2% | |
| U | 1254 | 1.1% | |
| Y | 1021 | 0.9% | |
| Z | 584 | 0.5% | |
| X | 506 | 0.4% | |
| Other values (307) | 1442 | 1.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 55991 | 12.6% | |
| e | 46246 | 10.4% | |
| i | 40686 | 9.2% | |
| n | 36151 | 8.2% | |
| r | 32669 | 7.4% | |
| o | 30137 | 6.8% | |
| s | 24274 | 5.5% | |
| t | 24086 | 5.4% | |
| l | 22904 | 5.2% | |
| h | 19074 | 4.3% | |
| u | 15092 | 3.4% | |
| d | 12946 | 2.9% | |
| m | 11312 | 2.6% | |
| c | 11144 | 2.5% | |
| y | 9855 | 2.2% | |
| k | 8710 | 2.0% | |
| g | 7259 | 1.6% | |
| p | 5619 | 1.3% | |
| b | 5326 | 1.2% | |
| v | 5007 | 1.1% | |
| w | 4917 | 1.1% | |
| f | 3371 | 0.8% | |
| z | 2298 | 0.5% | |
| j | 1981 | 0.4% | |
| x | 1036 | 0.2% | |
| Other values (452) | 4703 | 1.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 55588 | > 99.9% | ||
| 6 | < 0.1% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| 🇺 | 734 | 4.9% | |
| 🇮 | 716 | 4.8% | |
| 🇳 | 696 | 4.6% | |
| 💙 | 559 | 3.7% | |
| 🇪 | 489 | 3.3% | |
| 🇸 | 403 | 2.7% | |
| 😷 | 393 | 2.6% | |
| 🌈 | 323 | 2.2% | |
| 🇬 | 320 | 2.1% | |
| 🇧 | 316 | 2.1% | |
| 🌍 | 282 | 1.9% | |
| 🇦 | 277 | 1.8% | |
| 🏳 | 256 | 1.7% | |
| 🇨 | 229 | 1.5% | |
| 🌊 | 207 | 1.4% | |
| 🇷 | 205 | 1.4% | |
| 🇰 | 182 | 1.2% | |
| 🏴 | 177 | 1.2% | |
| 🇵 | 168 | 1.1% | |
| 🇭 | 161 | 1.1% | |
| ™ | 154 | 1.0% | |
| 🇱 | 153 | 1.0% | |
| 🇲 | 127 | 0.8% | |
| 🇹 | 116 | 0.8% | |
| 🇿 | 113 | 0.8% | |
| Other values (862) | 7246 | 48.3% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 3471 | 39.0% | |
| # | 1542 | 17.3% | |
| , | 1309 | 14.7% | |
| ' | 521 | 5.9% | |
| / | 394 | 4.4% | |
| ! | 342 | 3.8% | |
| @ | 336 | 3.8% | |
| & | 265 | 3.0% | |
| : | 147 | 1.7% | |
| * | 137 | 1.5% | |
| % | 114 | 1.3% | |
| " | 96 | 1.1% | |
| • | 57 | 0.6% | |
| ? | 44 | 0.5% | |
| 。 | 27 | 0.3% | |
| § | 13 | 0.1% | |
| \ | 13 | 0.1% | |
| ・ | 9 | 0.1% | |
| 〽 | 9 | 0.1% | |
| ¡ | 9 | 0.1% | |
| † | 7 | 0.1% | |
| ; | 7 | 0.1% | |
| · | 5 | 0.1% | |
| ๏ | 4 | < 0.1% | |
| ، | 2 | < 0.1% | |
| Other values (16) | 23 | 0.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 756 | 18.0% | |
| 1 | 716 | 17.1% | |
| 0 | 598 | 14.2% | |
| 4 | 439 | 10.5% | |
| 7 | 371 | 8.8% | |
| 3 | 315 | 7.5% | |
| 9 | 301 | 7.2% | |
| 5 | 269 | 6.4% | |
| 8 | 253 | 6.0% | |
| 6 | 164 | 3.9% | |
| ໐ | 2 | < 0.1% | |
| ૮ | 2 | < 0.1% | |
| ૯ | 2 | < 0.1% | |
| ๓ | 1 | < 0.1% | |
| ० | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 0 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 𝟟 | 1 | < 0.1% | |
| 𝟝 | 1 | < 0.1% | |
| ੯ | 1 | < 0.1% | |
| ໒ | 1 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1226 | 99.0% | |
| – | 5 | 0.4% | |
| — | 4 | 0.3% | |
| 〰 | 2 | 0.2% | |
| ‑ | 1 | 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| | | 292 | 51.4% | |
| + | 124 | 21.8% | |
| ~ | 72 | 12.7% | |
| = | 31 | 5.5% | |
| ∴ | 18 | 3.2% | |
| ∂ | 7 | 1.2% | |
| ∆ | 4 | 0.7% | |
| ≋ | 4 | 0.7% | |
| ⧖ | 3 | 0.5% | |
| ⋆ | 2 | 0.4% | |
| ∞ | 2 | 0.4% | |
| ℘ | 2 | 0.4% | |
| ◻ | 2 | 0.4% | |
| ↔ | 1 | 0.2% | |
| ÷ | 1 | 0.2% | |
| ∀ | 1 | 0.2% | |
| ⊕ | 1 | 0.2% | |
| ≜ | 1 | 0.2% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 630 | 99.2% | |
| ‿ | 5 | 0.8% |
Most frequent Nonspacing Mark characters
| Value | Count | Frequency (%) | |
| ️ | 1387 | 49.0% | |
| ͟ | 260 | 9.2% | |
| ् | 212 | 7.5% | |
| ் | 98 | 3.5% | |
| ं | 87 | 3.1% | |
| ु | 72 | 2.5% | |
| े | 67 | 2.4% | |
| ︎ | 49 | 1.7% | |
| ू | 34 | 1.2% | |
| ್ | 30 | 1.1% | |
| َ | 27 | 1.0% | |
| ୍ | 21 | 0.7% | |
| ُ | 19 | 0.7% | |
| ّ | 19 | 0.7% | |
| ି | 18 | 0.6% | |
| ै | 14 | 0.5% | |
| ಿ | 11 | 0.4% | |
| ্ | 11 | 0.4% | |
| ِ | 10 | 0.4% | |
| ్ | 9 | 0.3% | |
| ੰ | 9 | 0.3% | |
| ْ | 8 | 0.3% | |
| ీ | 8 | 0.3% | |
| ̶ | 8 | 0.3% | |
| ̴ | 8 | 0.3% | |
| Other values (134) | 334 | 11.8% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| ا | 435 | 5.9% | |
| र | 263 | 3.5% | |
| ل | 207 | 2.8% | |
| ن | 206 | 2.8% | |
| م | 190 | 2.6% | |
| ی | 173 | 2.3% | |
| و | 160 | 2.2% | |
| ر | 146 | 2.0% | |
| ي | 140 | 1.9% | |
| स | 135 | 1.8% | |
| त | 129 | 1.7% | |
| म | 128 | 1.7% | |
| न | 126 | 1.7% | |
| क | 123 | 1.7% | |
| د | 121 | 1.6% | |
| 沈 | 119 | 1.6% | |
| 诗 | 119 | 1.6% | |
| 伟 | 119 | 1.6% | |
| व | 107 | 1.4% | |
| س | 104 | 1.4% | |
| ب | 103 | 1.4% | |
| प | 89 | 1.2% | |
| द | 86 | 1.2% | |
| य | 76 | 1.0% | |
| ع | 73 | 1.0% | |
| Other values (730) | 3735 | 50.4% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| 🏻 | 148 | 44.0% | |
| 🏾 | 37 | 11.0% | |
| 🏼 | 35 | 10.4% | |
| 🏽 | 31 | 9.2% | |
| ^ | 23 | 6.8% | |
| ¯ | 14 | 4.2% | |
| ` | 12 | 3.6% | |
| 🏿 | 11 | 3.3% | |
| ﮼ | 9 | 2.7% | |
| ꜀ | 4 | 1.2% | |
| ¸ | 4 | 1.2% | |
| ´ | 4 | 1.2% | |
| ˃ | 2 | 0.6% | |
| ˂ | 2 | 0.6% |
Most frequent Initial Punctuation characters
| Value | Count | Frequency (%) | |
| « | 42 | 48.3% | |
| “ | 36 | 41.4% | |
| ‘ | 9 | 10.3% |
Most frequent Final Punctuation characters
| Value | Count | Frequency (%) | |
| ’ | 85 | 50.3% | |
| ” | 42 | 24.9% | |
| » | 42 | 24.9% |
Most frequent Format characters
| Value | Count | Frequency (%) | |
| | 483 | 37.1% | |
| | 152 | 11.7% | |
| | 127 | 9.7% | |
| | 127 | 9.7% | |
| | 102 | 7.8% | |
| | 62 | 4.8% | |
| | 62 | 4.8% | |
| | 40 | 3.1% | |
| | 40 | 3.1% | |
| | 29 | 2.2% | |
| | 25 | 1.9% | |
| | 25 | 1.9% | |
| | 8 | 0.6% | |
| | 8 | 0.6% | |
| | 7 | 0.5% | |
| | 6 | 0.5% |
Most frequent Enclosing Mark characters
| Value | Count | Frequency (%) | |
| ⃣ | 35 | 72.9% | |
| ҉ | 9 | 18.8% | |
| ⃤ | 4 | 8.3% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 30 | 42.3% | |
| € | 12 | 16.9% | |
| ¤ | 8 | 11.3% | |
| ₹ | 7 | 9.9% | |
| ¥ | 4 | 5.6% | |
| ¢ | 3 | 4.2% | |
| ₿ | 2 | 2.8% | |
| £ | 2 | 2.8% | |
| ꠸ | 1 | 1.4% | |
| ₲ | 1 | 1.4% | |
| ₮ | 1 | 1.4% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ➐ | 16 | 33.3% | |
| ⁷ | 10 | 20.8% | |
| ² | 7 | 14.6% | |
| ⁴ | 2 | 4.2% | |
| ¹ | 2 | 4.2% | |
| ⑤ | 2 | 4.2% | |
| ³ | 2 | 4.2% | |
| ➂ | 1 | 2.1% | |
| ❾ | 1 | 2.1% | |
| ¾ | 1 | 2.1% | |
| ❼ | 1 | 2.1% | |
| ② | 1 | 2.1% | |
| ⑧ | 1 | 2.1% | |
| ⁸ | 1 | 2.1% |
Most frequent Private Use characters
| Value | Count | Frequency (%) | |
| | 22 | 42.3% | |
| | 10 | 19.2% | |
| | 5 | 9.6% | |
| | 5 | 9.6% | |
| | 5 | 9.6% | |
| | 5 | 9.6% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 743 | 95.4% | |
| [ | 23 | 3.0% | |
| { | 6 | 0.8% | |
| 「 | 2 | 0.3% | |
| ( | 1 | 0.1% | |
| 《 | 1 | 0.1% | |
| ⦅ | 1 | 0.1% | |
| ༺ | 1 | 0.1% | |
| ︵ | 1 | 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 762 | 95.1% | |
| ] | 24 | 3.0% | |
| } | 6 | 0.7% | |
| ) | 3 | 0.4% | |
| 」 | 2 | 0.2% | |
| 》 | 1 | 0.1% | |
| ⟆ | 1 | 0.1% | |
| ⦆ | 1 | 0.1% | |
| ༻ | 1 | 0.1% |
Most frequent Spacing Mark characters
| Value | Count | Frequency (%) | |
| ा | 288 | 28.0% | |
| ि | 194 | 18.9% | |
| ी | 118 | 11.5% | |
| ா | 58 | 5.6% | |
| ி | 56 | 5.5% | |
| ो | 48 | 4.7% | |
| ு | 36 | 3.5% | |
| া | 24 | 2.3% | |
| ौ | 16 | 1.6% | |
| ੀ | 15 | 1.5% | |
| ਿ | 15 | 1.5% | |
| ಾ | 12 | 1.2% | |
| ெ | 11 | 1.1% | |
| ি | 9 | 0.9% | |
| ে | 7 | 0.7% | |
| ೀ | 7 | 0.7% | |
| ು | 7 | 0.7% | |
| ਾ | 7 | 0.7% | |
| ை | 7 | 0.7% | |
| ା | 7 | 0.7% | |
| ు | 7 | 0.7% | |
| ே | 6 | 0.6% | |
| ॉ | 6 | 0.6% | |
| ಂ | 5 | 0.5% | |
| ા | 5 | 0.5% | |
| Other values (27) | 56 | 5.5% |
Most frequent Modifier Letter characters
| Value | Count | Frequency (%) | |
| ᵃ | 20 | 18.7% | |
| ゚ | 9 | 8.4% | |
| ᵒ | 7 | 6.5% | |
| ᵉ | 7 | 6.5% | |
| ʸ | 6 | 5.6% | |
| ᵗ | 6 | 5.6% | |
| ʷ | 6 | 5.6% | |
| ᴾ | 5 | 4.7% | |
| ᴴ | 5 | 4.7% | |
| ᴮ | 4 | 3.7% | |
| ᴱ | 4 | 3.7% | |
| ー | 4 | 3.7% | |
| ʳ | 4 | 3.7% | |
| ᵈ | 4 | 3.7% | |
| ـ | 3 | 2.8% | |
| ˡ | 3 | 2.8% | |
| ᵍ | 3 | 2.8% | |
| ʰ | 2 | 1.9% | |
| ᵐ | 2 | 1.9% | |
| ʻ | 1 | 0.9% | |
| ᵏ | 1 | 0.9% | |
| ˢ | 1 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 556091 | 84.1% | |
| Common | 92146 | 13.9% | |
| Devanagari | 3072 | 0.5% | |
| Arabic | 2692 | 0.4% | |
| Inherited | 2517 | 0.4% | |
| Han | 1169 | 0.2% | |
| Cyrillic | 715 | 0.1% | |
| Tamil | 619 | 0.1% | |
| Greek | 219 | < 0.1% | |
| Kannada | 213 | < 0.1% | |
| Katakana | 180 | < 0.1% | |
| Oriya | 175 | < 0.1% | |
| Bengali | 172 | < 0.1% | |
| Gurmukhi | 150 | < 0.1% | |
| Hebrew | 123 | < 0.1% | |
| Thai | 118 | < 0.1% | |
| Telugu | 108 | < 0.1% | |
| Canadian_Aboriginal | 75 | < 0.1% | |
| Hangul | 73 | < 0.1% | |
| Gujarati | 61 | < 0.1% | |
| Unknown | 52 | < 0.1% | |
| Armenian | 44 | < 0.1% | |
| Malayalam | 43 | < 0.1% | |
| Ethiopic | 40 | < 0.1% | |
| Hiragana | 39 | < 0.1% | |
| Other values (29) | 272 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 55991 | 10.1% | |
| e | 46246 | 8.3% | |
| i | 40686 | 7.3% | |
| n | 36151 | 6.5% | |
| r | 32669 | 5.9% | |
| o | 30137 | 5.4% | |
| s | 24274 | 4.4% | |
| t | 24086 | 4.3% | |
| l | 22904 | 4.1% | |
| h | 19074 | 3.4% | |
| u | 15092 | 2.7% | |
| d | 12946 | 2.3% | |
| m | 11312 | 2.0% | |
| S | 11175 | 2.0% | |
| c | 11144 | 2.0% | |
| y | 9855 | 1.8% | |
| M | 8836 | 1.6% | |
| k | 8710 | 1.6% | |
| A | 8454 | 1.5% | |
| g | 7259 | 1.3% | |
| C | 7102 | 1.3% | |
| T | 6998 | 1.3% | |
| D | 6623 | 1.2% | |
| R | 6099 | 1.1% | |
| N | 5746 | 1.0% | |
| Other values (243) | 86522 | 15.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 55588 | 60.3% | ||
| . | 3471 | 3.8% | |
| # | 1542 | 1.7% | |
| , | 1309 | 1.4% | |
| - | 1226 | 1.3% | |
| ) | 762 | 0.8% | |
| 2 | 756 | 0.8% | |
| ( | 743 | 0.8% | |
| 🇺 | 734 | 0.8% | |
| 1 | 716 | 0.8% | |
| 🇮 | 716 | 0.8% | |
| 🇳 | 696 | 0.8% | |
| _ | 630 | 0.7% | |
| 0 | 598 | 0.6% | |
| 💙 | 559 | 0.6% | |
| ' | 521 | 0.6% | |
| 🇪 | 489 | 0.5% | |
| 4 | 439 | 0.5% | |
| 🇸 | 403 | 0.4% | |
| / | 394 | 0.4% | |
| 😷 | 393 | 0.4% | |
| 7 | 371 | 0.4% | |
| ! | 342 | 0.4% | |
| @ | 336 | 0.4% | |
| 🌈 | 323 | 0.4% | |
| Other values (1424) | 18089 | 19.6% |
Most frequent Inherited characters
| Value | Count | Frequency (%) | |
| ️ | 1387 | 55.1% | |
| | 483 | 19.2% | |
| ͟ | 260 | 10.3% | |
| ︎ | 49 | 1.9% | |
| ⃣ | 35 | 1.4% | |
| َ | 27 | 1.1% | |
| ُ | 19 | 0.8% | |
| ّ | 19 | 0.8% | |
| ِ | 10 | 0.4% | |
| | 8 | 0.3% | |
| ْ | 8 | 0.3% | |
| ̶ | 8 | 0.3% | |
| ̴ | 8 | 0.3% | |
| ̵ | 7 | 0.3% | |
| ̞ | 5 | 0.2% | |
| ̄ | 5 | 0.2% | |
| ͡ | 5 | 0.2% | |
| ͂ | 5 | 0.2% | |
| ͊ | 5 | 0.2% | |
| ͑ | 5 | 0.2% | |
| ̭ | 5 | 0.2% | |
| ̙ | 5 | 0.2% | |
| ́ | 4 | 0.2% | |
| ͜ | 4 | 0.2% | |
| ͛ | 4 | 0.2% | |
| Other values (68) | 137 | 5.4% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 435 | 16.2% | |
| ل | 207 | 7.7% | |
| ن | 206 | 7.7% | |
| م | 190 | 7.1% | |
| ی | 173 | 6.4% | |
| و | 160 | 5.9% | |
| ر | 146 | 5.4% | |
| ي | 140 | 5.2% | |
| د | 121 | 4.5% | |
| س | 104 | 3.9% | |
| ب | 103 | 3.8% | |
| ع | 73 | 2.7% | |
| ح | 60 | 2.2% | |
| ج | 55 | 2.0% | |
| ش | 40 | 1.5% | |
| ز | 38 | 1.4% | |
| ف | 37 | 1.4% | |
| ه | 36 | 1.3% | |
| ت | 36 | 1.3% | |
| ٹ | 35 | 1.3% | |
| غ | 34 | 1.3% | |
| ق | 30 | 1.1% | |
| ة | 29 | 1.1% | |
| ہ | 25 | 0.9% | |
| ص | 20 | 0.7% | |
| Other values (26) | 159 | 5.9% |
Most frequent Han characters
| Value | Count | Frequency (%) | |
| 沈 | 119 | 10.2% | |
| 诗 | 119 | 10.2% | |
| 伟 | 119 | 10.2% | |
| 中 | 65 | 5.6% | |
| 新 | 57 | 4.9% | |
| 国 | 56 | 4.8% | |
| 闻 | 55 | 4.7% | |
| 网 | 55 | 4.7% | |
| 李 | 23 | 2.0% | |
| 碧 | 23 | 2.0% | |
| 建 | 23 | 2.0% | |
| 木 | 12 | 1.0% | |
| 根 | 11 | 0.9% | |
| 渕 | 11 | 0.9% | |
| 猛 | 11 | 0.9% | |
| 彡 | 8 | 0.7% | |
| 華 | 8 | 0.7% | |
| 美 | 8 | 0.7% | |
| 大 | 7 | 0.6% | |
| 南 | 7 | 0.6% | |
| 渡 | 7 | 0.6% | |
| 部 | 7 | 0.6% | |
| 智 | 7 | 0.6% | |
| 子 | 7 | 0.6% | |
| 蘋 | 6 | 0.5% | |
| Other values (183) | 338 | 28.9% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ್ | 30 | 14.1% | |
| ರ | 20 | 9.4% | |
| ನ | 12 | 5.6% | |
| ಾ | 12 | 5.6% | |
| ಿ | 11 | 5.2% | |
| ವ | 11 | 5.2% | |
| ಶ | 8 | 3.8% | |
| ಮ | 8 | 3.8% | |
| ೆ | 7 | 3.3% | |
| ೀ | 7 | 3.3% | |
| ು | 7 | 3.3% | |
| ಲ | 6 | 2.8% | |
| ಬ | 5 | 2.3% | |
| ಂ | 5 | 2.3% | |
| ಗ | 5 | 2.3% | |
| ಕ | 5 | 2.3% | |
| ತ | 4 | 1.9% | |
| ಸ | 4 | 1.9% | |
| ಜ | 4 | 1.9% | |
| ಯ | 4 | 1.9% | |
| ೇ | 4 | 1.9% | |
| ಹ | 4 | 1.9% | |
| ಳ | 3 | 1.4% | |
| ೂ | 3 | 1.4% | |
| ೈ | 3 | 1.4% | |
| Other values (13) | 21 | 9.9% |
Most frequent Unknown characters
| Value | Count | Frequency (%) | |
| | 22 | 42.3% | |
| | 10 | 19.2% | |
| | 5 | 9.6% | |
| | 5 | 9.6% | |
| | 5 | 9.6% | |
| | 5 | 9.6% |
Most frequent Hebrew characters
| Value | Count | Frequency (%) | |
| י | 15 | 12.2% | |
| א | 12 | 9.8% | |
| ל | 10 | 8.1% | |
| ר | 10 | 8.1% | |
| פ | 8 | 6.5% | |
| ס | 8 | 6.5% | |
| ב | 8 | 6.5% | |
| ה | 6 | 4.9% | |
| נ | 6 | 4.9% | |
| ד | 6 | 4.9% | |
| מ | 5 | 4.1% | |
| ש | 4 | 3.3% | |
| ן | 4 | 3.3% | |
| ת | 4 | 3.3% | |
| ָ | 3 | 2.4% | |
| ִ | 2 | 1.6% | |
| ע | 1 | 0.8% | |
| ט | 1 | 0.8% | |
| צ | 1 | 0.8% | |
| ק | 1 | 0.8% | |
| ץ | 1 | 0.8% | |
| ׁ | 1 | 0.8% | |
| ֵ | 1 | 0.8% | |
| ֲ | 1 | 0.8% | |
| ַ | 1 | 0.8% | |
| Other values (3) | 3 | 2.4% |
Most frequent Ethiopic characters
| Value | Count | Frequency (%) | |
| ን | 5 | 12.5% | |
| ድ | 5 | 12.5% | |
| ፌ | 4 | 10.0% | |
| ሊ | 4 | 10.0% | |
| ፔ | 4 | 10.0% | |
| አ | 4 | 10.0% | |
| ሬ | 4 | 10.0% | |
| ስ | 4 | 10.0% | |
| ር | 2 | 5.0% | |
| ሃ | 1 | 2.5% | |
| ኮ | 1 | 2.5% | |
| ሂ | 1 | 2.5% | |
| ዱ | 1 | 2.5% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| ン | 25 | 13.9% | |
| ト | 12 | 6.7% | |
| モ | 11 | 6.1% | |
| ケ | 11 | 6.1% | |
| シ | 11 | 6.1% | |
| リ | 11 | 6.1% | |
| ヤ | 10 | 5.6% | |
| カ | 10 | 5.6% | |
| テ | 10 | 5.6% | |
| ウ | 9 | 5.0% | |
| ィ | 9 | 5.0% | |
| ツ | 7 | 3.9% | |
| ジ | 5 | 2.8% | |
| ミ | 4 | 2.2% | |
| マ | 4 | 2.2% | |
| エ | 3 | 1.7% | |
| イ | 3 | 1.7% | |
| ッ | 2 | 1.1% | |
| ク | 2 | 1.1% | |
| ハ | 2 | 1.1% | |
| ナ | 2 | 1.1% | |
| ヂ | 2 | 1.1% | |
| ド | 2 | 1.1% | |
| ョ | 2 | 1.1% | |
| ス | 1 | 0.6% | |
| Other values (10) | 10 | 5.6% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| и | 78 | 10.9% | |
| н | 69 | 9.7% | |
| т | 59 | 8.3% | |
| к | 47 | 6.6% | |
| у | 42 | 5.9% | |
| С | 35 | 4.9% | |
| п | 34 | 4.8% | |
| с | 33 | 4.6% | |
| о | 30 | 4.2% | |
| а | 25 | 3.5% | |
| в | 22 | 3.1% | |
| м | 20 | 2.8% | |
| е | 18 | 2.5% | |
| є | 16 | 2.2% | |
| л | 15 | 2.1% | |
| я | 14 | 2.0% | |
| р | 12 | 1.7% | |
| А | 11 | 1.5% | |
| ҉ | 9 | 1.3% | |
| ь | 9 | 1.3% | |
| ѕ | 9 | 1.3% | |
| Р | 8 | 1.1% | |
| д | 8 | 1.1% | |
| К | 7 | 1.0% | |
| П | 6 | 0.8% | |
| Other values (32) | 79 | 11.0% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| ा | 288 | 9.4% | |
| र | 263 | 8.6% | |
| ् | 212 | 6.9% | |
| ि | 194 | 6.3% | |
| स | 135 | 4.4% | |
| त | 129 | 4.2% | |
| म | 128 | 4.2% | |
| न | 126 | 4.1% | |
| क | 123 | 4.0% | |
| ी | 118 | 3.8% | |
| व | 107 | 3.5% | |
| प | 89 | 2.9% | |
| ं | 87 | 2.8% | |
| द | 86 | 2.8% | |
| य | 76 | 2.5% | |
| ु | 72 | 2.3% | |
| श | 71 | 2.3% | |
| ह | 68 | 2.2% | |
| े | 67 | 2.2% | |
| ज | 67 | 2.2% | |
| ल | 63 | 2.1% | |
| ो | 48 | 1.6% | |
| अ | 38 | 1.2% | |
| ग | 35 | 1.1% | |
| ू | 34 | 1.1% | |
| Other values (38) | 348 | 11.3% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| เ | 8 | 6.8% | |
| ร | 7 | 5.9% | |
| า | 7 | 5.9% | |
| น | 7 | 5.9% | |
| ม | 6 | 5.1% | |
| ก | 5 | 4.2% | |
| ย | 5 | 4.2% | |
| ล | 4 | 3.4% | |
| ๏ | 4 | 3.4% | |
| ี | 4 | 3.4% | |
| ่ | 4 | 3.4% | |
| ว | 4 | 3.4% | |
| ง | 3 | 2.5% | |
| ั | 3 | 2.5% | |
| ็ | 3 | 2.5% | |
| อ | 3 | 2.5% | |
| ้ | 3 | 2.5% | |
| ช | 3 | 2.5% | |
| ภ | 2 | 1.7% | |
| ุ | 2 | 1.7% | |
| ์ | 2 | 1.7% | |
| ท | 2 | 1.7% | |
| ส | 2 | 1.7% | |
| ด | 2 | 1.7% | |
| ต | 2 | 1.7% | |
| Other values (18) | 21 | 17.8% |
Most frequent Greek characters
| Value | Count | Frequency (%) | |
| α | 50 | 22.8% | |
| ι | 18 | 8.2% | |
| η | 15 | 6.8% | |
| σ | 13 | 5.9% | |
| Λ | 10 | 4.6% | |
| ε | 8 | 3.7% | |
| ν | 8 | 3.7% | |
| τ | 7 | 3.2% | |
| Δ | 6 | 2.7% | |
| π | 5 | 2.3% | |
| υ | 5 | 2.3% | |
| κ | 5 | 2.3% | |
| ο | 5 | 2.3% | |
| Ξ | 4 | 1.8% | |
| ς | 4 | 1.8% | |
| Κ | 4 | 1.8% | |
| ω | 4 | 1.8% | |
| Β | 3 | 1.4% | |
| Ο | 3 | 1.4% | |
| β | 2 | 0.9% | |
| λ | 2 | 0.9% | |
| Ι | 2 | 0.9% | |
| Γ | 2 | 0.9% | |
| Ε | 2 | 0.9% | |
| Ν | 2 | 0.9% | |
| Other values (23) | 30 | 13.7% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| ひ | 6 | 15.4% | |
| の | 3 | 7.7% | |
| し | 3 | 7.7% | |
| ま | 3 | 7.7% | |
| で | 2 | 5.1% | |
| き | 2 | 5.1% | |
| な | 2 | 5.1% | |
| い | 2 | 5.1% | |
| は | 2 | 5.1% | |
| す | 2 | 5.1% | |
| ね | 2 | 5.1% | |
| に | 2 | 5.1% | |
| り | 1 | 2.6% | |
| こ | 1 | 2.6% | |
| れ | 1 | 2.6% | |
| あ | 1 | 2.6% | |
| か | 1 | 2.6% | |
| げ | 1 | 2.6% | |
| た | 1 | 2.6% | |
| ん | 1 | 2.6% |
Most frequent Canadian_Aboriginal characters
| Value | Count | Frequency (%) | |
| ᗩ | 10 | 13.3% | |
| ᑎ | 8 | 10.7% | |
| ᖇ | 6 | 8.0% | |
| ᑭ | 6 | 8.0% | |
| ᒪ | 6 | 8.0% | |
| ᔕ | 5 | 6.7% | |
| ᕼ | 5 | 6.7% | |
| ᗪ | 5 | 6.7% | |
| ᒍ | 3 | 4.0% | |
| ᗰ | 3 | 4.0% | |
| ᑌ | 3 | 4.0% | |
| ᑕ | 2 | 2.7% | |
| ᘿ | 2 | 2.7% | |
| ᗯ | 2 | 2.7% | |
| ᒋ | 1 | 1.3% | |
| ᐯ | 1 | 1.3% | |
| ᕲ | 1 | 1.3% | |
| ᗴ | 1 | 1.3% | |
| ᖴ | 1 | 1.3% | |
| ᗞ | 1 | 1.3% | |
| ᗷ | 1 | 1.3% | |
| ᐠ | 1 | 1.3% | |
| ᐟ | 1 | 1.3% |
Most frequent Armenian characters
| Value | Count | Frequency (%) | |
| ղ | 14 | 31.8% | |
| օ | 5 | 11.4% | |
| ց | 4 | 9.1% | |
| հ | 3 | 6.8% | |
| ֆ | 3 | 6.8% | |
| յ | 2 | 4.5% | |
| ք | 2 | 4.5% | |
| Ե | 2 | 4.5% | |
| Տ | 2 | 4.5% | |
| Յ | 2 | 4.5% | |
| ժ | 1 | 2.3% | |
| Շ | 1 | 2.3% | |
| ա | 1 | 2.3% | |
| ե | 1 | 2.3% | |
| Թ | 1 | 2.3% |
Most frequent Coptic characters
| Value | Count | Frequency (%) | |
| ⲥ | 8 | 21.6% | |
| ⲓ | 8 | 21.6% | |
| Ⲃ | 4 | 10.8% | |
| ⲁ | 4 | 10.8% | |
| ⲗ | 4 | 10.8% | |
| ⲟ | 4 | 10.8% | |
| Ⲑ | 2 | 5.4% | |
| Ϣ | 1 | 2.7% | |
| Ϯ | 1 | 2.7% | |
| Ⲁ | 1 | 2.7% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠑ | 3 | 27.3% | |
| ⠁ | 2 | 18.2% | |
| ⠝ | 1 | 9.1% | |
| ⠃ | 1 | 9.1% | |
| ⠇ | 1 | 9.1% | |
| ⠓ | 1 | 9.1% | |
| ⠍ | 1 | 9.1% | |
| ⠙ | 1 | 9.1% |
Most frequent Gurmukhi characters
| Value | Count | Frequency (%) | |
| ੀ | 15 | 10.0% | |
| ਿ | 15 | 10.0% | |
| ਸ | 10 | 6.7% | |
| ਪ | 9 | 6.0% | |
| ੰ | 9 | 6.0% | |
| ਰ | 9 | 6.0% | |
| ਨ | 7 | 4.7% | |
| ਘ | 7 | 4.7% | |
| ਾ | 7 | 4.7% | |
| ਲ | 7 | 4.7% | |
| ਦ | 6 | 4.0% | |
| ਆ | 5 | 3.3% | |
| ਤ | 4 | 2.7% | |
| ਗ | 4 | 2.7% | |
| ਕ | 4 | 2.7% | |
| ਟ | 3 | 2.0% | |
| ਊ | 3 | 2.0% | |
| ਜ਼ | 3 | 2.0% | |
| ਬ | 3 | 2.0% | |
| ੌ | 3 | 2.0% | |
| ਂ | 3 | 2.0% | |
| ਡ | 3 | 2.0% | |
| ਵ | 2 | 1.3% | |
| ੁ | 2 | 1.3% | |
| ਮ | 2 | 1.3% | |
| Other values (5) | 5 | 3.3% |
Most frequent Linear_B characters
| Value | Count | Frequency (%) | |
| 𐂂 | 1 | 100.0% |
Most frequent Bengali characters
| Value | Count | Frequency (%) | |
| া | 24 | 14.0% | |
| ্ | 11 | 6.4% | |
| ব | 10 | 5.8% | |
| ল | 10 | 5.8% | |
| ি | 9 | 5.2% | |
| ে | 7 | 4.1% | |
| ম | 7 | 4.1% | |
| ন | 7 | 4.1% | |
| ৰ | 6 | 3.5% | |
| ত | 6 | 3.5% | |
| য | 5 | 2.9% | |
| জ | 5 | 2.9% | |
| প | 5 | 2.9% | |
| ক | 5 | 2.9% | |
| দ | 4 | 2.3% | |
| স | 4 | 2.3% | |
| ী | 4 | 2.3% | |
| র | 4 | 2.3% | |
| হ | 3 | 1.7% | |
| য় | 3 | 1.7% | |
| ণ | 3 | 1.7% | |
| ু | 3 | 1.7% | |
| গ | 3 | 1.7% | |
| ো | 2 | 1.2% | |
| শ | 2 | 1.2% | |
| Other values (16) | 20 | 11.6% |
Most frequent Egyptian_Hieroglyphs characters
| Value | Count | Frequency (%) | |
| 𓃬 | 2 | 16.7% | |
| 𓆉 | 2 | 16.7% | |
| 𓆩 | 1 | 8.3% | |
| 𓆪 | 1 | 8.3% | |
| 𓊈 | 1 | 8.3% | |
| 𓊉 | 1 | 8.3% | |
| 𓆏 | 1 | 8.3% | |
| 𓅪 | 1 | 8.3% | |
| 𓂆 | 1 | 8.3% | |
| 𓆌 | 1 | 8.3% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 이 | 4 | 5.5% | |
| 레 | 4 | 5.5% | |
| 임 | 4 | 5.5% | |
| 택 | 4 | 5.5% | |
| 용 | 4 | 5.5% | |
| 베 | 2 | 2.7% | |
| 나 | 2 | 2.7% | |
| 니 | 2 | 2.7% | |
| 김 | 2 | 2.7% | |
| 인 | 2 | 2.7% | |
| ᆺ | 2 | 2.7% | |
| 몬 | 1 | 1.4% | |
| 누 | 1 | 1.4% | |
| 사 | 1 | 1.4% | |
| 랑 | 1 | 1.4% | |
| 케 | 1 | 1.4% | |
| 트 | 1 | 1.4% | |
| 알 | 1 | 1.4% | |
| 렉 | 1 | 1.4% | |
| 산 | 1 | 1.4% | |
| 더 | 1 | 1.4% | |
| 대 | 1 | 1.4% | |
| 왕 | 1 | 1.4% | |
| 한 | 1 | 1.4% | |
| 국 | 1 | 1.4% | |
| Other values (27) | 27 | 37.0% |
Most frequent Lao characters
| Value | Count | Frequency (%) | |
| ໐ | 2 | 33.3% | |
| ບ | 1 | 16.7% | |
| ຯ | 1 | 16.7% | |
| ໒ | 1 | 16.7% | |
| ຖ | 1 | 16.7% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| ్ | 9 | 8.3% | |
| ర | 9 | 8.3% | |
| ీ | 8 | 7.4% | |
| ు | 7 | 6.5% | |
| శ | 5 | 4.6% | |
| ి | 5 | 4.6% | |
| వ | 5 | 4.6% | |
| ప | 5 | 4.6% | |
| ా | 5 | 4.6% | |
| బ | 5 | 4.6% | |
| ో | 4 | 3.7% | |
| హ | 4 | 3.7% | |
| స | 3 | 2.8% | |
| ే | 3 | 2.8% | |
| ె | 3 | 2.8% | |
| డ | 3 | 2.8% | |
| ం | 3 | 2.8% | |
| గ | 2 | 1.9% | |
| ల | 2 | 1.9% | |
| న | 2 | 1.9% | |
| క | 2 | 1.9% | |
| చ | 2 | 1.9% | |
| ూ | 2 | 1.9% | |
| ఌ | 2 | 1.9% | |
| ద | 2 | 1.9% | |
| Other values (6) | 6 | 5.6% |
Most frequent Malayalam characters
| Value | Count | Frequency (%) | |
| ക | 6 | 14.0% | |
| ു | 4 | 9.3% | |
| ണ | 4 | 9.3% | |
| ന | 3 | 7.0% | |
| അ | 3 | 7.0% | |
| ് | 3 | 7.0% | |
| ര | 2 | 4.7% | |
| ൃ | 2 | 4.7% | |
| ഷ | 2 | 4.7% | |
| ൻ | 2 | 4.7% | |
| ാ | 2 | 4.7% | |
| യ | 2 | 4.7% | |
| ർ | 2 | 4.7% | |
| സ | 1 | 2.3% | |
| ീ | 1 | 2.3% | |
| ച | 1 | 2.3% | |
| വ | 1 | 2.3% | |
| ധ | 1 | 2.3% | |
| ം | 1 | 2.3% |
Most frequent Tibetan characters
| Value | Count | Frequency (%) | |
| ༄ | 1 | 14.3% | |
| ༆ | 1 | 14.3% | |
| ࿐ | 1 | 14.3% | |
| ཽ | 1 | 14.3% | |
| ༅ | 1 | 14.3% | |
| ༺ | 1 | 14.3% | |
| ༻ | 1 | 14.3% |
Most frequent Bamum characters
| Value | Count | Frequency (%) | |
| 𖤐 | 2 | 100.0% |
Most frequent Tifinagh characters
| Value | Count | Frequency (%) | |
| ⵉ | 6 | 24.0% | |
| ⵏ | 4 | 16.0% | |
| ⴻ | 4 | 16.0% | |
| ⵣ | 3 | 12.0% | |
| ⵃ | 2 | 8.0% | |
| ⵛ | 2 | 8.0% | |
| ⵕ | 2 | 8.0% | |
| ⵎ | 2 | 8.0% |
Most frequent Cherokee characters
| Value | Count | Frequency (%) | |
| Ꭵ | 15 | 50.0% | |
| Ꭹ | 4 | 13.3% | |
| Ꭿ | 2 | 6.7% | |
| Ꮎ | 2 | 6.7% | |
| Ꮗ | 2 | 6.7% | |
| Ꭲ | 1 | 3.3% | |
| Ꮖ | 1 | 3.3% | |
| Ꮶ | 1 | 3.3% | |
| Ꮻ | 1 | 3.3% | |
| Ꮯ | 1 | 3.3% |
Most frequent Tagbanwa characters
| Value | Count | Frequency (%) | |
| ᝪ | 1 | 100.0% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 98 | 15.8% | |
| ா | 58 | 9.4% | |
| ி | 56 | 9.0% | |
| ல | 42 | 6.8% | |
| ன | 39 | 6.3% | |
| ு | 36 | 5.8% | |
| ட | 36 | 5.8% | |
| ர | 32 | 5.2% | |
| க | 28 | 4.5% | |
| ஸ | 23 | 3.7% | |
| ம | 21 | 3.4% | |
| த | 19 | 3.1% | |
| வ | 19 | 3.1% | |
| ப | 14 | 2.3% | |
| ச | 12 | 1.9% | |
| ெ | 11 | 1.8% | |
| அ | 10 | 1.6% | |
| ய | 8 | 1.3% | |
| ை | 7 | 1.1% | |
| ீ | 7 | 1.1% | |
| ே | 6 | 1.0% | |
| ந | 5 | 0.8% | |
| எ | 5 | 0.8% | |
| ள | 4 | 0.6% | |
| ஷ | 3 | 0.5% | |
| Other values (10) | 20 | 3.2% |
Most frequent Gujarati characters
| Value | Count | Frequency (%) | |
| ા | 5 | 8.2% | |
| મ | 5 | 8.2% | |
| ી | 5 | 8.2% | |
| પ | 3 | 4.9% | |
| લ | 3 | 4.9% | |
| હ | 3 | 4.9% | |
| ત | 3 | 4.9% | |
| ય | 2 | 3.3% | |
| ે | 2 | 3.3% | |
| સ | 2 | 3.3% | |
| ુ | 2 | 3.3% | |
| શ | 2 | 3.3% | |
| ્ | 2 | 3.3% | |
| ો | 2 | 3.3% | |
| દ | 2 | 3.3% | |
| ર | 2 | 3.3% | |
| ૮ | 2 | 3.3% | |
| ૯ | 2 | 3.3% | |
| ઝ | 2 | 3.3% | |
| જ | 1 | 1.6% | |
| ન | 1 | 1.6% | |
| ખ | 1 | 1.6% | |
| િ | 1 | 1.6% | |
| ઇ | 1 | 1.6% | |
| ં | 1 | 1.6% | |
| Other values (4) | 4 | 6.6% |
Most frequent Oriya characters
| Value | Count | Frequency (%) | |
| ୍ | 21 | 12.0% | |
| ି | 18 | 10.3% | |
| ନ | 12 | 6.9% | |
| ର | 11 | 6.3% | |
| ଜ | 9 | 5.1% | |
| ତ | 9 | 5.1% | |
| ୟ | 8 | 4.6% | |
| ଡ଼ | 8 | 4.6% | |
| ପ | 7 | 4.0% | |
| ା | 7 | 4.0% | |
| ୁ | 6 | 3.4% | |
| ଦ | 6 | 3.4% | |
| ୀ | 5 | 2.9% | |
| ଡ | 5 | 2.9% | |
| ଶ | 5 | 2.9% | |
| ସ | 4 | 2.3% | |
| କ | 4 | 2.3% | |
| ଓ | 4 | 2.3% | |
| ଆ | 4 | 2.3% | |
| ଣ | 3 | 1.7% | |
| ମ | 3 | 1.7% | |
| ଭ | 2 | 1.1% | |
| ବ | 2 | 1.1% | |
| ଚ | 2 | 1.1% | |
| ଞ | 1 | 0.6% | |
| Other values (9) | 9 | 5.1% |
Most frequent Sharada characters
| Value | Count | Frequency (%) | |
| 𑆳 | 2 | 22.2% | |
| 𑆮 | 1 | 11.1% | |
| 𑆴 | 1 | 11.1% | |
| 𑆑 | 1 | 11.1% | |
| 𑆱 | 1 | 11.1% | |
| 𑆫 | 1 | 11.1% | |
| 𑆽 | 1 | 11.1% | |
| 𑆤 | 1 | 11.1% |
Most frequent Kayah_Li characters
| Value | Count | Frequency (%) | |
| ꤌ | 3 | 33.3% | |
| ꤖ | 2 | 22.2% | |
| ꤠ | 1 | 11.1% | |
| ꤚ | 1 | 11.1% | |
| ꤍ | 1 | 11.1% | |
| ꤥ | 1 | 11.1% |
Most frequent Javanese characters
| Value | Count | Frequency (%) | |
| ꧁ | 2 | 66.7% | |
| ꧂ | 1 | 33.3% |
Most frequent Balinese characters
| Value | Count | Frequency (%) | |
| ᭄ | 2 | 100.0% |
Most frequent Georgian characters
| Value | Count | Frequency (%) | |
| მ | 3 | 42.9% | |
| ღ | 2 | 28.6% | |
| ყ | 1 | 14.3% | |
| ო | 1 | 14.3% |
Most frequent Khmer characters
| Value | Count | Frequency (%) | |
| អ | 3 | 11.1% | |
| ៊ | 3 | 11.1% | |
| ូ | 3 | 11.1% | |
| រ | 3 | 11.1% | |
| ិ | 3 | 11.1% | |
| ទ | 3 | 11.1% | |
| ្ | 3 | 11.1% | |
| ឋ | 3 | 11.1% | |
| ី | 3 | 11.1% |
Most frequent Cuneiform characters
| Value | Count | Frequency (%) | |
| 𒇷 | 1 | 33.3% | |
| 𒁯 | 1 | 33.3% | |
| 𒅗 | 1 | 33.3% |
Most frequent Tai_Tham characters
| Value | Count | Frequency (%) | |
| ᪥ | 2 | 100.0% |
Most frequent Old_South_Arabian characters
| Value | Count | Frequency (%) | |
| 𐩱 | 3 | 25.0% | |
| 𐩬 | 2 | 16.7% | |
| 𐩡 | 2 | 16.7% | |
| 𐩴 | 1 | 8.3% | |
| 𐩤 | 1 | 8.3% | |
| 𐩢 | 1 | 8.3% | |
| 𐩷 | 1 | 8.3% | |
| 𐩺 | 1 | 8.3% |
Most frequent Bopomofo characters
| Value | Count | Frequency (%) | |
| ㄥ | 1 | 50.0% | |
| ㄖ | 1 | 50.0% |
Most frequent Myanmar characters
| Value | Count | Frequency (%) | |
| သ | 1 | 50.0% | |
| ူ | 1 | 50.0% |
Most frequent Runic characters
| Value | Count | Frequency (%) | |
| ᚱ | 10 | 28.6% | |
| ᛁ | 10 | 28.6% | |
| ᚷ | 5 | 14.3% | |
| ᛗ | 5 | 14.3% | |
| ᚾ | 5 | 14.3% |
Most frequent Thaana characters
| Value | Count | Frequency (%) | |
| ވ | 1 | 16.7% | |
| ަ | 1 | 16.7% | |
| އ | 1 | 16.7% | |
| ް | 1 | 16.7% | |
| ޑ | 1 | 16.7% | |
| ެ | 1 | 16.7% |
Most frequent Limbu characters
| Value | Count | Frequency (%) | |
| ᥅ | 2 | 100.0% |
Most frequent Tai_Viet characters
| Value | Count | Frequency (%) | |
| ꪖ | 3 | 60.0% | |
| ꪮ | 1 | 20.0% | |
| ꫀ | 1 | 20.0% |
Most frequent New_Tai_Lue characters
| Value | Count | Frequency (%) | |
| ᦓ | 1 | 50.0% | |
| ᦔ | 1 | 50.0% |
Most frequent Tai_Le characters
| Value | Count | Frequency (%) | |
| ᥴ | 1 | 100.0% |
Most frequent Tagalog characters
| Value | Count | Frequency (%) | |
| ᜈ | 2 | 20.0% | |
| ᜔ | 2 | 20.0% | |
| ᜁ | 1 | 10.0% | |
| ᜇ | 1 | 10.0% | |
| ᜏ | 1 | 10.0% | |
| ᜋ | 1 | 10.0% | |
| ᜅ | 1 | 10.0% | |
| ᜐ | 1 | 10.0% |
Most frequent Yi characters
| Value | Count | Frequency (%) | |
| ꒱ | 1 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 627090 | 94.8% | |
| None | 7057 | 1.1% | |
| Enclosed Alphanum Sup | 5964 | 0.9% | |
| Devanagari | 3075 | 0.5% | |
| Math Alphanum | 2879 | 0.4% | |
| Arabic | 2776 | 0.4% | |
| VS | 1436 | 0.2% | |
| CJK | 1169 | 0.2% | |
| Misc Symbols | 984 | 0.1% | |
| Latin 1 Sup | 957 | 0.1% | |
| Punctuation | 791 | 0.1% | |
| Tags | 762 | 0.1% | |
| Cyrillic | 715 | 0.1% | |
| Emoticons | 653 | 0.1% | |
| Tamil | 619 | 0.1% | |
| Dingbats | 575 | 0.1% | |
| Diacriticals | 459 | 0.1% | |
| Phonetic Ext | 263 | < 0.1% | |
| IPA Ext | 245 | < 0.1% | |
| Letterlike Symbols | 215 | < 0.1% | |
| Kannada | 213 | < 0.1% | |
| Latin Ext A | 193 | < 0.1% | |
| Katakana | 184 | < 0.1% | |
| Oriya | 175 | < 0.1% | |
| Bengali | 172 | < 0.1% | |
| Other values (64) | 1557 | 0.2% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 55991 | 8.9% | |
| 55588 | 8.9% | ||
| e | 46246 | 7.4% | |
| i | 40686 | 6.5% | |
| n | 36151 | 5.8% | |
| r | 32669 | 5.2% | |
| o | 30137 | 4.8% | |
| s | 24274 | 3.9% | |
| t | 24086 | 3.8% | |
| l | 22904 | 3.7% | |
| h | 19074 | 3.0% | |
| u | 15092 | 2.4% | |
| d | 12946 | 2.1% | |
| m | 11312 | 1.8% | |
| S | 11175 | 1.8% | |
| c | 11144 | 1.8% | |
| y | 9855 | 1.6% | |
| M | 8836 | 1.4% | |
| k | 8710 | 1.4% | |
| A | 8454 | 1.3% | |
| g | 7259 | 1.2% | |
| C | 7102 | 1.1% | |
| T | 6998 | 1.1% | |
| D | 6623 | 1.1% | |
| R | 6099 | 1.0% | |
| Other values (68) | 107679 | 17.2% |
Most frequent Enclosed Alphanum Sup characters
| Value | Count | Frequency (%) | |
| 🇺 | 734 | 12.3% | |
| 🇮 | 716 | 12.0% | |
| 🇳 | 696 | 11.7% | |
| 🇪 | 489 | 8.2% | |
| 🇸 | 403 | 6.8% | |
| 🇬 | 320 | 5.4% | |
| 🇧 | 316 | 5.3% | |
| 🇦 | 277 | 4.6% | |
| 🇨 | 229 | 3.8% | |
| 🇷 | 205 | 3.4% | |
| 🇰 | 182 | 3.1% | |
| 🇵 | 168 | 2.8% | |
| 🇭 | 161 | 2.7% | |
| 🇱 | 153 | 2.6% | |
| 🇲 | 127 | 2.1% | |
| 🇹 | 116 | 1.9% | |
| 🇿 | 113 | 1.9% | |
| 🇼 | 72 | 1.2% | |
| 🇩 | 67 | 1.1% | |
| 🇫 | 56 | 0.9% | |
| 🇾 | 38 | 0.6% | |
| 🇯 | 30 | 0.5% | |
| 🇴 | 28 | 0.5% | |
| 🅰 | 27 | 0.5% | |
| 🇽 | 16 | 0.3% | |
| Other values (55) | 225 | 3.8% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| 💙 | 559 | 7.9% | |
| 🌈 | 323 | 4.6% | |
| 🌍 | 282 | 4.0% | |
| 🏳 | 256 | 3.6% | |
| 🌊 | 207 | 2.9% | |
| 🏴 | 177 | 2.5% | |
| 🏻 | 148 | 2.1% | |
| 🌹 | 102 | 1.4% | |
| 💛 | 96 | 1.4% | |
| 💉 | 93 | 1.3% | |
| 💜 | 92 | 1.3% | |
| 💚 | 90 | 1.3% | |
| 🌎 | 89 | 1.3% | |
| 🕷 | 87 | 1.2% | |
| 🕊 | 68 | 1.0% | |
| 🧡 | 58 | 0.8% | |
| 🌐 | 55 | 0.8% | |
| 🚩 | 53 | 0.8% | |
| 🌱 | 52 | 0.7% | |
| 🦋 | 52 | 0.7% | |
| α | 50 | 0.7% | |
| 🌺 | 47 | 0.7% | |
| 🐝 | 46 | 0.7% | |
| 👑 | 45 | 0.6% | |
| 🌻 | 45 | 0.6% | |
| Other values (633) | 3885 | 55.1% |
Most frequent Dingbats characters
| Value | Count | Frequency (%) | |
| ❤ | 112 | 19.5% | |
| ✨ | 79 | 13.7% | |
| ❄ | 56 | 9.7% | |
| ✊ | 53 | 9.2% | |
| ➡ | 41 | 7.1% | |
| ✌ | 26 | 4.5% | |
| ✝ | 21 | 3.7% | |
| ❌ | 21 | 3.7% | |
| ✈ | 16 | 2.8% | |
| ➐ | 16 | 2.8% | |
| ✋ | 16 | 2.8% | |
| ✒ | 11 | 1.9% | |
| ❓ | 11 | 1.9% | |
| ➕ | 11 | 1.9% | |
| ✡ | 10 | 1.7% | |
| ✍ | 9 | 1.6% | |
| ❣ | 9 | 1.6% | |
| ✪ | 7 | 1.2% | |
| ✾ | 5 | 0.9% | |
| ✳ | 5 | 0.9% | |
| ❁ | 4 | 0.7% | |
| ❃ | 2 | 0.3% | |
| ✖ | 2 | 0.3% | |
| ✫ | 2 | 0.3% | |
| ✵ | 2 | 0.3% | |
| Other values (19) | 28 | 4.9% |
Most frequent VS characters
| Value | Count | Frequency (%) | |
| ️ | 1387 | 96.6% | |
| ︎ | 49 | 3.4% |
Most frequent Misc Symbols characters
| Value | Count | Frequency (%) | |
| ♥ | 65 | 6.6% | |
| ♀ | 54 | 5.5% | |
| ☮ | 51 | 5.2% | |
| ♂ | 49 | 5.0% | |
| ☠ | 48 | 4.9% | |
| ☆ | 42 | 4.3% | |
| ⚖ | 37 | 3.8% | |
| ⚕ | 35 | 3.6% | |
| ☀ | 33 | 3.4% | |
| ☕ | 32 | 3.3% | |
| ♿ | 30 | 3.0% | |
| ☘ | 26 | 2.6% | |
| ★ | 25 | 2.5% | |
| ☪ | 23 | 2.3% | |
| ⚡ | 21 | 2.1% | |
| ⚜ | 20 | 2.0% | |
| ⚒ | 19 | 1.9% | |
| ♡ | 17 | 1.7% | |
| ☭ | 17 | 1.7% | |
| ⚔ | 16 | 1.6% | |
| ⚘ | 12 | 1.2% | |
| ⚾ | 12 | 1.2% | |
| ⚽ | 12 | 1.2% | |
| ☯ | 12 | 1.2% | |
| ⚓ | 12 | 1.2% | |
| Other values (71) | 264 | 26.8% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 435 | 15.7% | |
| ل | 207 | 7.5% | |
| ن | 206 | 7.4% | |
| م | 190 | 6.8% | |
| ی | 173 | 6.2% | |
| و | 160 | 5.8% | |
| ر | 146 | 5.3% | |
| ي | 140 | 5.0% | |
| د | 121 | 4.4% | |
| س | 104 | 3.7% | |
| ب | 103 | 3.7% | |
| ع | 73 | 2.6% | |
| ح | 60 | 2.2% | |
| ج | 55 | 2.0% | |
| ش | 40 | 1.4% | |
| ز | 38 | 1.4% | |
| ف | 37 | 1.3% | |
| ه | 36 | 1.3% | |
| ت | 36 | 1.3% | |
| ٹ | 35 | 1.3% | |
| غ | 34 | 1.2% | |
| ق | 30 | 1.1% | |
| ة | 29 | 1.0% | |
| َ | 27 | 1.0% | |
| ہ | 25 | 0.9% | |
| Other values (35) | 236 | 8.5% |
Most frequent Math Alphanum characters
| Value | Count | Frequency (%) | |
| 𝐚 | 74 | 2.6% | |
| 𝓪 | 47 | 1.6% | |
| 𝖆 | 47 | 1.6% | |
| 𝕚 | 44 | 1.5% | |
| 𝕝 | 44 | 1.5% | |
| 𝕟 | 43 | 1.5% | |
| 𝓮 | 42 | 1.5% | |
| 𝓲 | 39 | 1.4% | |
| 𝖓 | 39 | 1.4% | |
| 𝕖 | 38 | 1.3% | |
| 𝕒 | 37 | 1.3% | |
| 𝓵 | 35 | 1.2% | |
| 𝐡 | 33 | 1.1% | |
| 𝕙 | 32 | 1.1% | |
| 𝖔 | 31 | 1.1% | |
| 𝐧 | 31 | 1.1% | |
| 𝖊 | 30 | 1.0% | |
| 𝗲 | 27 | 0.9% | |
| 𝖗 | 26 | 0.9% | |
| 𝕠 | 26 | 0.9% | |
| 𝐢 | 25 | 0.9% | |
| 𝖙 | 25 | 0.9% | |
| 𝚎 | 24 | 0.8% | |
| 𝕤 | 24 | 0.8% | |
| 𝗮 | 23 | 0.8% | |
| Other values (377) | 1993 | 69.2% |
Most frequent Emoticons characters
| Value | Count | Frequency (%) | |
| 😷 | 393 | 60.2% | |
| 🙏 | 54 | 8.3% | |
| 🙂 | 29 | 4.4% | |
| 😎 | 25 | 3.8% | |
| 🙈 | 15 | 2.3% | |
| 🙉 | 11 | 1.7% | |
| 🙊 | 11 | 1.7% | |
| 😍 | 11 | 1.7% | |
| 😉 | 11 | 1.7% | |
| 😡 | 8 | 1.2% | |
| 🙌 | 7 | 1.1% | |
| 😈 | 7 | 1.1% | |
| 😆 | 7 | 1.1% | |
| 😃 | 5 | 0.8% | |
| 😊 | 4 | 0.6% | |
| 🙃 | 4 | 0.6% | |
| 😺 | 4 | 0.6% | |
| 😇 | 4 | 0.6% | |
| 😻 | 4 | 0.6% | |
| 😀 | 3 | 0.5% | |
| 🙋 | 3 | 0.5% | |
| 😁 | 3 | 0.5% | |
| 😬 | 3 | 0.5% | |
| 😼 | 2 | 0.3% | |
| 🙅 | 2 | 0.3% | |
| Other values (17) | 23 | 3.5% |
Most frequent Punctuation characters
| Value | Count | Frequency (%) | |
| | 483 | 61.1% | |
| ’ | 85 | 10.7% | |
| • | 57 | 7.2% | |
| ” | 42 | 5.3% | |
| “ | 36 | 4.6% | |
| | 29 | 3.7% | |
| ‘ | 9 | 1.1% | |
| | 8 | 1.0% | |
| | 8 | 1.0% | |
| † | 7 | 0.9% | |
| | 7 | 0.9% | |
| ‿ | 5 | 0.6% | |
| – | 5 | 0.6% | |
| — | 4 | 0.5% | |
| ⁎ | 2 | 0.3% | |
| ‾ | 2 | 0.3% | |
| ⁂ | 1 | 0.1% | |
| ‑ | 1 | 0.1% |
Most frequent Latin 1 Sup characters
| Value | Count | Frequency (%) | |
| é | 128 | 13.4% | |
| á | 120 | 12.5% | |
| í | 72 | 7.5% | |
| ® | 68 | 7.1% | |
| « | 42 | 4.4% | |
| » | 42 | 4.4% | |
| ñ | 39 | 4.1% | |
| ö | 35 | 3.7% | |
| ó | 32 | 3.3% | |
| © | 28 | 2.9% | |
| ü | 27 | 2.8% | |
| ° | 24 | 2.5% | |
| ä | 18 | 1.9% | |
| ¯ | 14 | 1.5% | |
| § | 13 | 1.4% | |
| ò | 13 | 1.4% | |
| Ó | 12 | 1.3% | |
| ë | 11 | 1.1% | |
| è | 11 | 1.1% | |
| ç | 9 | 0.9% | |
| ¡ | 9 | 0.9% | |
| ú | 9 | 0.9% | |
| ø | 9 | 0.9% | |
| É | 8 | 0.8% | |
| ¤ | 8 | 0.8% | |
| Other values (49) | 156 | 16.3% |
Most frequent CJK characters
| Value | Count | Frequency (%) | |
| 沈 | 119 | 10.2% | |
| 诗 | 119 | 10.2% | |
| 伟 | 119 | 10.2% | |
| 中 | 65 | 5.6% | |
| 新 | 57 | 4.9% | |
| 国 | 56 | 4.8% | |
| 闻 | 55 | 4.7% | |
| 网 | 55 | 4.7% | |
| 李 | 23 | 2.0% | |
| 碧 | 23 | 2.0% | |
| 建 | 23 | 2.0% | |
| 木 | 12 | 1.0% | |
| 根 | 11 | 0.9% | |
| 渕 | 11 | 0.9% | |
| 猛 | 11 | 0.9% | |
| 彡 | 8 | 0.7% | |
| 華 | 8 | 0.7% | |
| 美 | 8 | 0.7% | |
| 大 | 7 | 0.6% | |
| 南 | 7 | 0.6% | |
| 渡 | 7 | 0.6% | |
| 部 | 7 | 0.6% | |
| 智 | 7 | 0.6% | |
| 子 | 7 | 0.6% | |
| 蘋 | 6 | 0.5% | |
| Other values (183) | 338 | 28.9% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ್ | 30 | 14.1% | |
| ರ | 20 | 9.4% | |
| ನ | 12 | 5.6% | |
| ಾ | 12 | 5.6% | |
| ಿ | 11 | 5.2% | |
| ವ | 11 | 5.2% | |
| ಶ | 8 | 3.8% | |
| ಮ | 8 | 3.8% | |
| ೆ | 7 | 3.3% | |
| ೀ | 7 | 3.3% | |
| ು | 7 | 3.3% | |
| ಲ | 6 | 2.8% | |
| ಬ | 5 | 2.3% | |
| ಂ | 5 | 2.3% | |
| ಗ | 5 | 2.3% | |
| ಕ | 5 | 2.3% | |
| ತ | 4 | 1.9% | |
| ಸ | 4 | 1.9% | |
| ಜ | 4 | 1.9% | |
| ಯ | 4 | 1.9% | |
| ೇ | 4 | 1.9% | |
| ಹ | 4 | 1.9% | |
| ಳ | 3 | 1.4% | |
| ೂ | 3 | 1.4% | |
| ೈ | 3 | 1.4% | |
| Other values (13) | 21 | 9.9% |
Most frequent PUA characters
| Value | Count | Frequency (%) | |
| | 22 | 42.3% | |
| | 10 | 19.2% | |
| | 5 | 9.6% | |
| | 5 | 9.6% | |
| | 5 | 9.6% | |
| | 5 | 9.6% |
Most frequent Hebrew characters
| Value | Count | Frequency (%) | |
| י | 15 | 12.2% | |
| א | 12 | 9.8% | |
| ל | 10 | 8.1% | |
| ר | 10 | 8.1% | |
| פ | 8 | 6.5% | |
| ס | 8 | 6.5% | |
| ב | 8 | 6.5% | |
| ה | 6 | 4.9% | |
| נ | 6 | 4.9% | |
| ד | 6 | 4.9% | |
| מ | 5 | 4.1% | |
| ש | 4 | 3.3% | |
| ן | 4 | 3.3% | |
| ת | 4 | 3.3% | |
| ָ | 3 | 2.4% | |
| ִ | 2 | 1.6% | |
| ע | 1 | 0.8% | |
| ט | 1 | 0.8% | |
| צ | 1 | 0.8% | |
| ק | 1 | 0.8% | |
| ץ | 1 | 0.8% | |
| ׁ | 1 | 0.8% | |
| ֵ | 1 | 0.8% | |
| ֲ | 1 | 0.8% | |
| ַ | 1 | 0.8% | |
| Other values (3) | 3 | 2.4% |
Most frequent Ethiopic characters
| Value | Count | Frequency (%) | |
| ን | 5 | 12.5% | |
| ድ | 5 | 12.5% | |
| ፌ | 4 | 10.0% | |
| ሊ | 4 | 10.0% | |
| ፔ | 4 | 10.0% | |
| አ | 4 | 10.0% | |
| ሬ | 4 | 10.0% | |
| ስ | 4 | 10.0% | |
| ር | 2 | 5.0% | |
| ሃ | 1 | 2.5% | |
| ኮ | 1 | 2.5% | |
| ሂ | 1 | 2.5% | |
| ዱ | 1 | 2.5% |
Most frequent Modifier Tone Letters characters
| Value | Count | Frequency (%) | |
| ꜀ | 4 | 100.0% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| ン | 25 | 13.6% | |
| ト | 12 | 6.5% | |
| モ | 11 | 6.0% | |
| ケ | 11 | 6.0% | |
| シ | 11 | 6.0% | |
| リ | 11 | 6.0% | |
| ヤ | 10 | 5.4% | |
| カ | 10 | 5.4% | |
| テ | 10 | 5.4% | |
| ウ | 9 | 4.9% | |
| ィ | 9 | 4.9% | |
| ツ | 7 | 3.8% | |
| ジ | 5 | 2.7% | |
| ミ | 4 | 2.2% | |
| マ | 4 | 2.2% | |
| ー | 4 | 2.2% | |
| エ | 3 | 1.6% | |
| イ | 3 | 1.6% | |
| ッ | 2 | 1.1% | |
| ク | 2 | 1.1% | |
| ハ | 2 | 1.1% | |
| ナ | 2 | 1.1% | |
| ヂ | 2 | 1.1% | |
| ド | 2 | 1.1% | |
| ョ | 2 | 1.1% | |
| Other values (11) | 11 | 6.0% |
Most frequent Latin Ext A characters
| Value | Count | Frequency (%) | |
| ć | 24 | 12.4% | |
| Ć | 14 | 7.3% | |
| ı | 13 | 6.7% | |
| č | 12 | 6.2% | |
| İ | 11 | 5.7% | |
| ā | 11 | 5.7% | |
| ř | 10 | 5.2% | |
| ē | 9 | 4.7% | |
| ş | 9 | 4.7% | |
| ł | 9 | 4.7% | |
| Š | 8 | 4.1% | |
| Ş | 6 | 3.1% | |
| š | 6 | 3.1% | |
| Č | 5 | 2.6% | |
| ğ | 4 | 2.1% | |
| ń | 3 | 1.6% | |
| ū | 3 | 1.6% | |
| Ł | 3 | 1.6% | |
| Ţ | 2 | 1.0% | |
| Ż | 2 | 1.0% | |
| Ī | 2 | 1.0% | |
| ď | 1 | 0.5% | |
| Ą | 1 | 0.5% | |
| Ğ | 1 | 0.5% | |
| ō | 1 | 0.5% | |
| Other values (23) | 23 | 11.9% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| и | 78 | 10.9% | |
| н | 69 | 9.7% | |
| т | 59 | 8.3% | |
| к | 47 | 6.6% | |
| у | 42 | 5.9% | |
| С | 35 | 4.9% | |
| п | 34 | 4.8% | |
| с | 33 | 4.6% | |
| о | 30 | 4.2% | |
| а | 25 | 3.5% | |
| в | 22 | 3.1% | |
| м | 20 | 2.8% | |
| е | 18 | 2.5% | |
| є | 16 | 2.2% | |
| л | 15 | 2.1% | |
| я | 14 | 2.0% | |
| р | 12 | 1.7% | |
| А | 11 | 1.5% | |
| ҉ | 9 | 1.3% | |
| ь | 9 | 1.3% | |
| ѕ | 9 | 1.3% | |
| Р | 8 | 1.1% | |
| д | 8 | 1.1% | |
| К | 7 | 1.0% | |
| П | 6 | 0.8% | |
| Other values (32) | 79 | 11.0% |
Most frequent Letterlike Symbols characters
| Value | Count | Frequency (%) | |
| ™ | 154 | 71.6% | |
| ℙ | 18 | 8.4% | |
| ℓ | 15 | 7.0% | |
| ℝ | 5 | 2.3% | |
| ℹ | 4 | 1.9% | |
| ℭ | 2 | 0.9% | |
| ℘ | 2 | 0.9% | |
| ℰ | 2 | 0.9% | |
| ℂ | 2 | 0.9% | |
| ℳ | 1 | 0.5% | |
| ℐ | 1 | 0.5% | |
| ℯ | 1 | 0.5% | |
| ℒ | 1 | 0.5% | |
| ℴ | 1 | 0.5% | |
| ℊ | 1 | 0.5% | |
| ℎ | 1 | 0.5% | |
| ℍ | 1 | 0.5% | |
| ℕ | 1 | 0.5% | |
| ⅈ | 1 | 0.5% | |
| ℌ | 1 | 0.5% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| ा | 288 | 9.4% | |
| र | 263 | 8.6% | |
| ् | 212 | 6.9% | |
| ि | 194 | 6.3% | |
| स | 135 | 4.4% | |
| त | 129 | 4.2% | |
| म | 128 | 4.2% | |
| न | 126 | 4.1% | |
| क | 123 | 4.0% | |
| ी | 118 | 3.8% | |
| व | 107 | 3.5% | |
| प | 89 | 2.9% | |
| ं | 87 | 2.8% | |
| द | 86 | 2.8% | |
| य | 76 | 2.5% | |
| ु | 72 | 2.3% | |
| श | 71 | 2.3% | |
| ह | 68 | 2.2% | |
| े | 67 | 2.2% | |
| ज | 67 | 2.2% | |
| ल | 63 | 2.0% | |
| ो | 48 | 1.6% | |
| अ | 38 | 1.2% | |
| ग | 35 | 1.1% | |
| ू | 34 | 1.1% | |
| Other values (40) | 351 | 11.4% |
Most frequent Tags characters
| Value | Count | Frequency (%) | |
| | 152 | 19.9% | |
| | 127 | 16.7% | |
| | 127 | 16.7% | |
| | 102 | 13.4% | |
| | 62 | 8.1% | |
| | 62 | 8.1% | |
| | 40 | 5.2% | |
| | 40 | 5.2% | |
| | 25 | 3.3% | |
| | 25 | 3.3% |
Most frequent Latin Ext Additional characters
| Value | Count | Frequency (%) | |
| ḕ | 4 | 26.7% | |
| ệ | 2 | 13.3% | |
| Ṙ | 2 | 13.3% | |
| ḃ | 2 | 13.3% | |
| Ḷ | 2 | 13.3% | |
| ợ | 1 | 6.7% | |
| ẹ | 1 | 6.7% | |
| ể | 1 | 6.7% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| เ | 8 | 6.8% | |
| ร | 7 | 5.9% | |
| า | 7 | 5.9% | |
| น | 7 | 5.9% | |
| ม | 6 | 5.1% | |
| ก | 5 | 4.2% | |
| ย | 5 | 4.2% | |
| ล | 4 | 3.4% | |
| ๏ | 4 | 3.4% | |
| ี | 4 | 3.4% | |
| ่ | 4 | 3.4% | |
| ว | 4 | 3.4% | |
| ง | 3 | 2.5% | |
| ั | 3 | 2.5% | |
| ็ | 3 | 2.5% | |
| อ | 3 | 2.5% | |
| ้ | 3 | 2.5% | |
| ช | 3 | 2.5% | |
| ภ | 2 | 1.7% | |
| ุ | 2 | 1.7% | |
| ์ | 2 | 1.7% | |
| ท | 2 | 1.7% | |
| ส | 2 | 1.7% | |
| ด | 2 | 1.7% | |
| ต | 2 | 1.7% | |
| Other values (18) | 21 | 17.8% |
Most frequent Arabic PF A characters
| Value | Count | Frequency (%) | |
| ﮼ | 9 | 100.0% |
Most frequent IPA Ext characters
| Value | Count | Frequency (%) | |
| ɴ | 33 | 13.5% | |
| ʀ | 30 | 12.2% | |
| ɪ | 23 | 9.4% | |
| ʟ | 19 | 7.8% | |
| ɑ | 17 | 6.9% | |
| ɾ | 15 | 6.1% | |
| ʏ | 12 | 4.9% | |
| ʙ | 9 | 3.7% | |
| ʜ | 8 | 3.3% | |
| ɔ | 7 | 2.9% | |
| ɢ | 6 | 2.4% | |
| ʍ | 6 | 2.4% | |
| ʇ | 5 | 2.0% | |
| ɛ | 5 | 2.0% | |
| ɨ | 4 | 1.6% | |
| ɐ | 4 | 1.6% | |
| ɱ | 4 | 1.6% | |
| ɥ | 4 | 1.6% | |
| ʕ | 4 | 1.6% | |
| ʔ | 4 | 1.6% | |
| ʅ | 4 | 1.6% | |
| ɯ | 3 | 1.2% | |
| ʎ | 3 | 1.2% | |
| ɹ | 3 | 1.2% | |
| ʞ | 2 | 0.8% | |
| Other values (8) | 11 | 4.5% |
Most frequent Phonetic Ext characters
| Value | Count | Frequency (%) | |
| ᴇ | 57 | 21.7% | |
| ᴀ | 29 | 11.0% | |
| ᵃ | 20 | 7.6% | |
| ᴛ | 19 | 7.2% | |
| ᴏ | 17 | 6.5% | |
| ᴠ | 17 | 6.5% | |
| ᴍ | 13 | 4.9% | |
| ᴄ | 10 | 3.8% | |
| ᵒ | 7 | 2.7% | |
| ᵉ | 7 | 2.7% | |
| ᴋ | 7 | 2.7% | |
| ᴡ | 6 | 2.3% | |
| ᵗ | 6 | 2.3% | |
| ᴾ | 5 | 1.9% | |
| ᴴ | 5 | 1.9% | |
| ᴮ | 4 | 1.5% | |
| ᴱ | 4 | 1.5% | |
| ᴅ | 4 | 1.5% | |
| ᵈ | 4 | 1.5% | |
| ᴜ | 4 | 1.5% | |
| ᴥ | 4 | 1.5% | |
| ᵍ | 3 | 1.1% | |
| ᴚ | 3 | 1.1% | |
| ᴗ | 2 | 0.8% | |
| ᵐ | 2 | 0.8% | |
| Other values (4) | 4 | 1.5% |
Most frequent Arrows characters
| Value | Count | Frequency (%) | |
| ↙ | 6 | 75.0% | |
| ↔ | 1 | 12.5% | |
| ↗ | 1 | 12.5% |
Most frequent Diacriticals characters
| Value | Count | Frequency (%) | |
| ͟ | 260 | 56.6% | |
| ̶ | 8 | 1.7% | |
| ̴ | 8 | 1.7% | |
| ̵ | 7 | 1.5% | |
| ̞ | 5 | 1.1% | |
| ̄ | 5 | 1.1% | |
| ͡ | 5 | 1.1% | |
| ͂ | 5 | 1.1% | |
| ͊ | 5 | 1.1% | |
| ͑ | 5 | 1.1% | |
| ̭ | 5 | 1.1% | |
| ̙ | 5 | 1.1% | |
| ́ | 4 | 0.9% | |
| ͜ | 4 | 0.9% | |
| ͛ | 4 | 0.9% | |
| ͪ | 4 | 0.9% | |
| ̟ | 4 | 0.9% | |
| ͙ | 4 | 0.9% | |
| ̦ | 3 | 0.7% | |
| ̜ | 3 | 0.7% | |
| ̒ | 3 | 0.7% | |
| ̯ | 3 | 0.7% | |
| ̸ | 3 | 0.7% | |
| ̃ | 3 | 0.7% | |
| ̎ | 3 | 0.7% | |
| Other values (52) | 91 | 19.8% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| ひ | 6 | 15.4% | |
| の | 3 | 7.7% | |
| し | 3 | 7.7% | |
| ま | 3 | 7.7% | |
| で | 2 | 5.1% | |
| き | 2 | 5.1% | |
| な | 2 | 5.1% | |
| い | 2 | 5.1% | |
| は | 2 | 5.1% | |
| す | 2 | 5.1% | |
| ね | 2 | 5.1% | |
| に | 2 | 5.1% | |
| り | 1 | 2.6% | |
| こ | 1 | 2.6% | |
| れ | 1 | 2.6% | |
| あ | 1 | 2.6% | |
| か | 1 | 2.6% | |
| げ | 1 | 2.6% | |
| た | 1 | 2.6% | |
| ん | 1 | 2.6% |
Most frequent Misc Technical characters
| Value | Count | Frequency (%) | |
| ⏳ | 9 | 36.0% | |
| ⏺ | 7 | 28.0% | |
| ⌛ | 5 | 20.0% | |
| ⌬ | 1 | 4.0% | |
| ⎊ | 1 | 4.0% | |
| ⍟ | 1 | 4.0% | |
| ⌐ | 1 | 4.0% |
Most frequent UCAS characters
| Value | Count | Frequency (%) | |
| ᗩ | 10 | 13.3% | |
| ᑎ | 8 | 10.7% | |
| ᖇ | 6 | 8.0% | |
| ᑭ | 6 | 8.0% | |
| ᒪ | 6 | 8.0% | |
| ᔕ | 5 | 6.7% | |
| ᕼ | 5 | 6.7% | |
| ᗪ | 5 | 6.7% | |
| ᒍ | 3 | 4.0% | |
| ᗰ | 3 | 4.0% | |
| ᑌ | 3 | 4.0% | |
| ᑕ | 2 | 2.7% | |
| ᘿ | 2 | 2.7% | |
| ᗯ | 2 | 2.7% | |
| ᒋ | 1 | 1.3% | |
| ᐯ | 1 | 1.3% | |
| ᕲ | 1 | 1.3% | |
| ᗴ | 1 | 1.3% | |
| ᖴ | 1 | 1.3% | |
| ᗞ | 1 | 1.3% | |
| ᗷ | 1 | 1.3% | |
| ᐠ | 1 | 1.3% | |
| ᐟ | 1 | 1.3% |
Most frequent Enclosed Alphanum characters
| Value | Count | Frequency (%) | |
| Ⓥ | 26 | 25.0% | |
| Ⓜ | 22 | 21.2% | |
| ⓔ | 5 | 4.8% | |
| ⓐ | 5 | 4.8% | |
| ⓕ | 4 | 3.8% | |
| ⓘ | 4 | 3.8% | |
| Ⓐ | 3 | 2.9% | |
| Ⓡ | 3 | 2.9% | |
| Ⓙ | 2 | 1.9% | |
| ⓡ | 2 | 1.9% | |
| Ⓒ | 2 | 1.9% | |
| ⓢ | 2 | 1.9% | |
| ⓒ | 2 | 1.9% | |
| ⓣ | 2 | 1.9% | |
| ⑤ | 2 | 1.9% | |
| ⓛ | 2 | 1.9% | |
| ⓚ | 2 | 1.9% | |
| ⓑ | 1 | 1.0% | |
| ⓜ | 1 | 1.0% | |
| ⓞ | 1 | 1.0% | |
| ⓗ | 1 | 1.0% | |
| Ⓓ | 1 | 1.0% | |
| Ⓔ | 1 | 1.0% | |
| Ⓕ | 1 | 1.0% | |
| ② | 1 | 1.0% | |
| Other values (6) | 6 | 5.8% |
Most frequent Armenian characters
| Value | Count | Frequency (%) | |
| ղ | 14 | 31.8% | |
| օ | 5 | 11.4% | |
| ց | 4 | 9.1% | |
| հ | 3 | 6.8% | |
| ֆ | 3 | 6.8% | |
| յ | 2 | 4.5% | |
| ք | 2 | 4.5% | |
| Ե | 2 | 4.5% | |
| Տ | 2 | 4.5% | |
| Յ | 2 | 4.5% | |
| ժ | 1 | 2.3% | |
| Շ | 1 | 2.3% | |
| ա | 1 | 2.3% | |
| ե | 1 | 2.3% | |
| Թ | 1 | 2.3% |
Most frequent Coptic characters
| Value | Count | Frequency (%) | |
| ⲥ | 8 | 22.9% | |
| ⲓ | 8 | 22.9% | |
| Ⲃ | 4 | 11.4% | |
| ⲁ | 4 | 11.4% | |
| ⲗ | 4 | 11.4% | |
| ⲟ | 4 | 11.4% | |
| Ⲑ | 2 | 5.7% | |
| Ⲁ | 1 | 2.9% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠑ | 3 | 27.3% | |
| ⠁ | 2 | 18.2% | |
| ⠝ | 1 | 9.1% | |
| ⠃ | 1 | 9.1% | |
| ⠇ | 1 | 9.1% | |
| ⠓ | 1 | 9.1% | |
| ⠍ | 1 | 9.1% | |
| ⠙ | 1 | 9.1% |
Most frequent Gurmukhi characters
| Value | Count | Frequency (%) | |
| ੀ | 15 | 10.0% | |
| ਿ | 15 | 10.0% | |
| ਸ | 10 | 6.7% | |
| ਪ | 9 | 6.0% | |
| ੰ | 9 | 6.0% | |
| ਰ | 9 | 6.0% | |
| ਨ | 7 | 4.7% | |
| ਘ | 7 | 4.7% | |
| ਾ | 7 | 4.7% | |
| ਲ | 7 | 4.7% | |
| ਦ | 6 | 4.0% | |
| ਆ | 5 | 3.3% | |
| ਤ | 4 | 2.7% | |
| ਗ | 4 | 2.7% | |
| ਕ | 4 | 2.7% | |
| ਟ | 3 | 2.0% | |
| ਊ | 3 | 2.0% | |
| ਜ਼ | 3 | 2.0% | |
| ਬ | 3 | 2.0% | |
| ੌ | 3 | 2.0% | |
| ਂ | 3 | 2.0% | |
| ਡ | 3 | 2.0% | |
| ਵ | 2 | 1.3% | |
| ੁ | 2 | 1.3% | |
| ਮ | 2 | 1.3% | |
| Other values (5) | 5 | 3.3% |
Most frequent Math Operators characters
| Value | Count | Frequency (%) | |
| ∴ | 18 | 45.0% | |
| ∂ | 7 | 17.5% | |
| ∆ | 4 | 10.0% | |
| ≋ | 4 | 10.0% | |
| ⋆ | 2 | 5.0% | |
| ∞ | 2 | 5.0% | |
| ∀ | 1 | 2.5% | |
| ⊕ | 1 | 2.5% | |
| ≜ | 1 | 2.5% |
Most frequent Currency Symbols characters
| Value | Count | Frequency (%) | |
| € | 12 | 52.2% | |
| ₹ | 7 | 30.4% | |
| ₿ | 2 | 8.7% | |
| ₲ | 1 | 4.3% | |
| ₮ | 1 | 4.3% |
Most frequent Latin Ext B characters
| Value | Count | Frequency (%) | |
| ǟ | 5 | 16.7% | |
| ƃ | 4 | 13.3% | |
| ȝ | 4 | 13.3% | |
| ǝ | 3 | 10.0% | |
| ƛ | 3 | 10.0% | |
| Ƹ | 2 | 6.7% | |
| Ʒ | 1 | 3.3% | |
| Ƭ | 1 | 3.3% | |
| Ɩ | 1 | 3.3% | |
| ȶ | 1 | 3.3% | |
| ƈ | 1 | 3.3% | |
| Ⱥ | 1 | 3.3% | |
| ƫ | 1 | 3.3% | |
| Ɔ | 1 | 3.3% | |
| Ɓ | 1 | 3.3% |
Most frequent Linear B Ideograms characters
| Value | Count | Frequency (%) | |
| 𐂂 | 1 | 100.0% |
Most frequent Modifier Letters characters
| Value | Count | Frequency (%) | |
| ʸ | 6 | 22.2% | |
| ʷ | 6 | 22.2% | |
| ʳ | 4 | 14.8% | |
| ˡ | 3 | 11.1% | |
| ʰ | 2 | 7.4% | |
| ˃ | 2 | 7.4% | |
| ˂ | 2 | 7.4% | |
| ʻ | 1 | 3.7% | |
| ˢ | 1 | 3.7% |
Most frequent Bengali characters
| Value | Count | Frequency (%) | |
| া | 24 | 14.0% | |
| ্ | 11 | 6.4% | |
| ব | 10 | 5.8% | |
| ল | 10 | 5.8% | |
| ি | 9 | 5.2% | |
| ে | 7 | 4.1% | |
| ম | 7 | 4.1% | |
| ন | 7 | 4.1% | |
| ৰ | 6 | 3.5% | |
| ত | 6 | 3.5% | |
| য | 5 | 2.9% | |
| জ | 5 | 2.9% | |
| প | 5 | 2.9% | |
| ক | 5 | 2.9% | |
| দ | 4 | 2.3% | |
| স | 4 | 2.3% | |
| ী | 4 | 2.3% | |
| র | 4 | 2.3% | |
| হ | 3 | 1.7% | |
| য় | 3 | 1.7% | |
| ণ | 3 | 1.7% | |
| ু | 3 | 1.7% | |
| গ | 3 | 1.7% | |
| ো | 2 | 1.2% | |
| শ | 2 | 1.2% | |
| Other values (16) | 20 | 11.6% |
Most frequent Latin Ext C characters
| Value | Count | Frequency (%) | |
| Ɱ | 2 | 50.0% | |
| Ɽ | 1 | 25.0% | |
| Ⱬ | 1 | 25.0% |
Most frequent Egyptian Hieroglyphs characters
| Value | Count | Frequency (%) | |
| 𓃬 | 2 | 16.7% | |
| 𓆉 | 2 | 16.7% | |
| 𓆩 | 1 | 8.3% | |
| 𓆪 | 1 | 8.3% | |
| 𓊈 | 1 | 8.3% | |
| 𓊉 | 1 | 8.3% | |
| 𓆏 | 1 | 8.3% | |
| 𓅪 | 1 | 8.3% | |
| 𓂆 | 1 | 8.3% | |
| 𓆌 | 1 | 8.3% |
Most frequent Playing Cards characters
| Value | Count | Frequency (%) | |
| 🃏 | 3 | 100.0% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 이 | 4 | 5.7% | |
| 레 | 4 | 5.7% | |
| 임 | 4 | 5.7% | |
| 택 | 4 | 5.7% | |
| 용 | 4 | 5.7% | |
| 베 | 2 | 2.9% | |
| 나 | 2 | 2.9% | |
| 니 | 2 | 2.9% | |
| 김 | 2 | 2.9% | |
| 인 | 2 | 2.9% | |
| 몬 | 1 | 1.4% | |
| 누 | 1 | 1.4% | |
| 사 | 1 | 1.4% | |
| 랑 | 1 | 1.4% | |
| 케 | 1 | 1.4% | |
| 트 | 1 | 1.4% | |
| 알 | 1 | 1.4% | |
| 렉 | 1 | 1.4% | |
| 산 | 1 | 1.4% | |
| 더 | 1 | 1.4% | |
| 대 | 1 | 1.4% | |
| 왕 | 1 | 1.4% | |
| 한 | 1 | 1.4% | |
| 국 | 1 | 1.4% | |
| 까 | 1 | 1.4% | |
| Other values (25) | 25 | 35.7% |
Most frequent Lao characters
| Value | Count | Frequency (%) | |
| ໐ | 2 | 33.3% | |
| ບ | 1 | 16.7% | |
| ຯ | 1 | 16.7% | |
| ໒ | 1 | 16.7% | |
| ຖ | 1 | 16.7% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| ్ | 9 | 8.3% | |
| ర | 9 | 8.3% | |
| ీ | 8 | 7.4% | |
| ు | 7 | 6.5% | |
| శ | 5 | 4.6% | |
| ి | 5 | 4.6% | |
| వ | 5 | 4.6% | |
| ప | 5 | 4.6% | |
| ా | 5 | 4.6% | |
| బ | 5 | 4.6% | |
| ో | 4 | 3.7% | |
| హ | 4 | 3.7% | |
| స | 3 | 2.8% | |
| ే | 3 | 2.8% | |
| ె | 3 | 2.8% | |
| డ | 3 | 2.8% | |
| ం | 3 | 2.8% | |
| గ | 2 | 1.9% | |
| ల | 2 | 1.9% | |
| న | 2 | 1.9% | |
| క | 2 | 1.9% | |
| చ | 2 | 1.9% | |
| ూ | 2 | 1.9% | |
| ఌ | 2 | 1.9% | |
| ద | 2 | 1.9% | |
| Other values (6) | 6 | 5.6% |
Most frequent Malayalam characters
| Value | Count | Frequency (%) | |
| ക | 6 | 14.0% | |
| ു | 4 | 9.3% | |
| ണ | 4 | 9.3% | |
| ന | 3 | 7.0% | |
| അ | 3 | 7.0% | |
| ് | 3 | 7.0% | |
| ര | 2 | 4.7% | |
| ൃ | 2 | 4.7% | |
| ഷ | 2 | 4.7% | |
| ൻ | 2 | 4.7% | |
| ാ | 2 | 4.7% | |
| യ | 2 | 4.7% | |
| ർ | 2 | 4.7% | |
| സ | 1 | 2.3% | |
| ീ | 1 | 2.3% | |
| ച | 1 | 2.3% | |
| വ | 1 | 2.3% | |
| ധ | 1 | 2.3% | |
| ം | 1 | 2.3% |
Most frequent Geometric Shapes characters
| Value | Count | Frequency (%) | |
| ▪ | 9 | 23.7% | |
| ● | 5 | 13.2% | |
| ◉ | 5 | 13.2% | |
| ◕ | 4 | 10.5% | |
| ◎ | 4 | 10.5% | |
| ◇ | 2 | 5.3% | |
| △ | 2 | 5.3% | |
| ◻ | 2 | 5.3% | |
| ■ | 2 | 5.3% | |
| ◡ | 1 | 2.6% | |
| □ | 1 | 2.6% | |
| ▽ | 1 | 2.6% |
Most frequent Box Drawing characters
| Value | Count | Frequency (%) | |
| ╠ | 3 | 23.1% | |
| ╣ | 3 | 23.1% | |
| ╯ | 2 | 15.4% | |
| ┻ | 2 | 15.4% | |
| ╰ | 1 | 7.7% | |
| ╮ | 1 | 7.7% | |
| ━ | 1 | 7.7% |
Most frequent Tibetan characters
| Value | Count | Frequency (%) | |
| ༄ | 1 | 12.5% | |
| ༆ | 1 | 12.5% | |
| ࿐ | 1 | 12.5% | |
| ཽ | 1 | 12.5% | |
| ༅ | 1 | 12.5% | |
| ࿗ | 1 | 12.5% | |
| ༺ | 1 | 12.5% | |
| ༻ | 1 | 12.5% |
Most frequent Bamum Sup characters
| Value | Count | Frequency (%) | |
| 𖤐 | 2 | 100.0% |
Most frequent Tifinagh characters
| Value | Count | Frequency (%) | |
| ⵉ | 6 | 24.0% | |
| ⵏ | 4 | 16.0% | |
| ⴻ | 4 | 16.0% | |
| ⵣ | 3 | 12.0% | |
| ⵃ | 2 | 8.0% | |
| ⵛ | 2 | 8.0% | |
| ⵕ | 2 | 8.0% | |
| ⵎ | 2 | 8.0% |
Most frequent Cherokee characters
| Value | Count | Frequency (%) | |
| Ꭵ | 15 | 50.0% | |
| Ꭹ | 4 | 13.3% | |
| Ꭿ | 2 | 6.7% | |
| Ꮎ | 2 | 6.7% | |
| Ꮗ | 2 | 6.7% | |
| Ꭲ | 1 | 3.3% | |
| Ꮖ | 1 | 3.3% | |
| Ꮶ | 1 | 3.3% | |
| Ꮻ | 1 | 3.3% | |
| Ꮯ | 1 | 3.3% |
Most frequent Tagbanwa characters
| Value | Count | Frequency (%) | |
| ᝪ | 1 | 100.0% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 98 | 15.8% | |
| ா | 58 | 9.4% | |
| ி | 56 | 9.0% | |
| ல | 42 | 6.8% | |
| ன | 39 | 6.3% | |
| ு | 36 | 5.8% | |
| ட | 36 | 5.8% | |
| ர | 32 | 5.2% | |
| க | 28 | 4.5% | |
| ஸ | 23 | 3.7% | |
| ம | 21 | 3.4% | |
| த | 19 | 3.1% | |
| வ | 19 | 3.1% | |
| ப | 14 | 2.3% | |
| ச | 12 | 1.9% | |
| ெ | 11 | 1.8% | |
| அ | 10 | 1.6% | |
| ய | 8 | 1.3% | |
| ை | 7 | 1.1% | |
| ீ | 7 | 1.1% | |
| ே | 6 | 1.0% | |
| ந | 5 | 0.8% | |
| எ | 5 | 0.8% | |
| ள | 4 | 0.6% | |
| ஷ | 3 | 0.5% | |
| Other values (10) | 20 | 3.2% |
Most frequent Gujarati characters
| Value | Count | Frequency (%) | |
| ા | 5 | 8.2% | |
| મ | 5 | 8.2% | |
| ી | 5 | 8.2% | |
| પ | 3 | 4.9% | |
| લ | 3 | 4.9% | |
| હ | 3 | 4.9% | |
| ત | 3 | 4.9% | |
| ય | 2 | 3.3% | |
| ે | 2 | 3.3% | |
| સ | 2 | 3.3% | |
| ુ | 2 | 3.3% | |
| શ | 2 | 3.3% | |
| ્ | 2 | 3.3% | |
| ો | 2 | 3.3% | |
| દ | 2 | 3.3% | |
| ર | 2 | 3.3% | |
| ૮ | 2 | 3.3% | |
| ૯ | 2 | 3.3% | |
| ઝ | 2 | 3.3% | |
| જ | 1 | 1.6% | |
| ન | 1 | 1.6% | |
| ખ | 1 | 1.6% | |
| િ | 1 | 1.6% | |
| ઇ | 1 | 1.6% | |
| ં | 1 | 1.6% | |
| Other values (4) | 4 | 6.6% |
Most frequent Oriya characters
| Value | Count | Frequency (%) | |
| ୍ | 21 | 12.0% | |
| ି | 18 | 10.3% | |
| ନ | 12 | 6.9% | |
| ର | 11 | 6.3% | |
| ଜ | 9 | 5.1% | |
| ତ | 9 | 5.1% | |
| ୟ | 8 | 4.6% | |
| ଡ଼ | 8 | 4.6% | |
| ପ | 7 | 4.0% | |
| ା | 7 | 4.0% | |
| ୁ | 6 | 3.4% | |
| ଦ | 6 | 3.4% | |
| ୀ | 5 | 2.9% | |
| ଡ | 5 | 2.9% | |
| ଶ | 5 | 2.9% | |
| ସ | 4 | 2.3% | |
| କ | 4 | 2.3% | |
| ଓ | 4 | 2.3% | |
| ଆ | 4 | 2.3% | |
| ଣ | 3 | 1.7% | |
| ମ | 3 | 1.7% | |
| ଭ | 2 | 1.1% | |
| ବ | 2 | 1.1% | |
| ଚ | 2 | 1.1% | |
| ଞ | 1 | 0.6% | |
| Other values (9) | 9 | 5.1% |
Most frequent Sharada characters
| Value | Count | Frequency (%) | |
| 𑆳 | 2 | 22.2% | |
| 𑆮 | 1 | 11.1% | |
| 𑆴 | 1 | 11.1% | |
| 𑆑 | 1 | 11.1% | |
| 𑆱 | 1 | 11.1% | |
| 𑆫 | 1 | 11.1% | |
| 𑆽 | 1 | 11.1% | |
| 𑆤 | 1 | 11.1% |
Most frequent Kayah Li characters
| Value | Count | Frequency (%) | |
| ꤌ | 3 | 33.3% | |
| ꤖ | 2 | 22.2% | |
| ꤠ | 1 | 11.1% | |
| ꤚ | 1 | 11.1% | |
| ꤍ | 1 | 11.1% | |
| ꤥ | 1 | 11.1% |
Most frequent Geometric Shapes Ext characters
| Value | Count | Frequency (%) | |
| 🟣 | 3 | 75.0% | |
| 🟥 | 1 | 25.0% |
Most frequent Javanese characters
| Value | Count | Frequency (%) | |
| ꧁ | 2 | 66.7% | |
| ꧂ | 1 | 33.3% |
Most frequent Balinese characters
| Value | Count | Frequency (%) | |
| ᭄ | 2 | 100.0% |
Most frequent Georgian characters
| Value | Count | Frequency (%) | |
| მ | 3 | 42.9% | |
| ღ | 2 | 28.6% | |
| ყ | 1 | 14.3% | |
| ო | 1 | 14.3% |
Most frequent Misc Math Symbols A characters
| Value | Count | Frequency (%) | |
| ⟆ | 1 | 100.0% |
Most frequent Khmer characters
| Value | Count | Frequency (%) | |
| អ | 3 | 11.1% | |
| ៊ | 3 | 11.1% | |
| ូ | 3 | 11.1% | |
| រ | 3 | 11.1% | |
| ិ | 3 | 11.1% | |
| ទ | 3 | 11.1% | |
| ្ | 3 | 11.1% | |
| ឋ | 3 | 11.1% | |
| ី | 3 | 11.1% |
Most frequent Cuneiform characters
| Value | Count | Frequency (%) | |
| 𒇷 | 1 | 33.3% | |
| 𒁯 | 1 | 33.3% | |
| 𒅗 | 1 | 33.3% |
Most frequent Latin Ext D characters
| Value | Count | Frequency (%) | |
| ꜱ | 4 | 80.0% | |
| ꜰ | 1 | 20.0% |
Most frequent Tai Tham characters
| Value | Count | Frequency (%) | |
| ᪥ | 2 | 100.0% |
Most frequent Old South Arabian characters
| Value | Count | Frequency (%) | |
| 𐩱 | 3 | 25.0% | |
| 𐩬 | 2 | 16.7% | |
| 𐩡 | 2 | 16.7% | |
| 𐩴 | 1 | 8.3% | |
| 𐩤 | 1 | 8.3% | |
| 𐩢 | 1 | 8.3% | |
| 𐩷 | 1 | 8.3% | |
| 𐩺 | 1 | 8.3% |
Most frequent Compat Jamo characters
| Value | Count | Frequency (%) | |
| ㅤ | 1 | 100.0% |
Most frequent Bopomofo characters
| Value | Count | Frequency (%) | |
| ㄥ | 1 | 50.0% | |
| ㄖ | 1 | 50.0% |
Most frequent Jamo characters
| Value | Count | Frequency (%) | |
| ᆺ | 2 | 100.0% |
Most frequent CJK Compat Forms characters
| Value | Count | Frequency (%) | |
| ︵ | 1 | 100.0% |
Most frequent Myanmar characters
| Value | Count | Frequency (%) | |
| သ | 1 | 50.0% | |
| ူ | 1 | 50.0% |
Most frequent Runic characters
| Value | Count | Frequency (%) | |
| ᚱ | 10 | 28.6% | |
| ᛁ | 10 | 28.6% | |
| ᚷ | 5 | 14.3% | |
| ᛗ | 5 | 14.3% | |
| ᚾ | 5 | 14.3% |
Most frequent Thaana characters
| Value | Count | Frequency (%) | |
| ވ | 1 | 16.7% | |
| ަ | 1 | 16.7% | |
| އ | 1 | 16.7% | |
| ް | 1 | 16.7% | |
| ޑ | 1 | 16.7% | |
| ެ | 1 | 16.7% |
Most frequent Mahjong characters
| Value | Count | Frequency (%) | |
| 🀄 | 2 | 100.0% |
Most frequent Limbu characters
| Value | Count | Frequency (%) | |
| ᥅ | 2 | 100.0% |
Most frequent Tai Viet characters
| Value | Count | Frequency (%) | |
| ꪖ | 3 | 60.0% | |
| ꪮ | 1 | 20.0% | |
| ꫀ | 1 | 20.0% |
Most frequent New Tai Lue characters
| Value | Count | Frequency (%) | |
| ᦓ | 1 | 50.0% | |
| ᦔ | 1 | 50.0% |
Most frequent Tai Le characters
| Value | Count | Frequency (%) | |
| ᥴ | 1 | 100.0% |
Most frequent Indic Number Forms characters
| Value | Count | Frequency (%) | |
| ꠸ | 1 | 100.0% |
Most frequent Misc Math Symbols B characters
| Value | Count | Frequency (%) | |
| ⧖ | 3 | 100.0% |
Most frequent Tagalog characters
| Value | Count | Frequency (%) | |
| ᜈ | 2 | 20.0% | |
| ᜔ | 2 | 20.0% | |
| ᜁ | 1 | 10.0% | |
| ᜇ | 1 | 10.0% | |
| ᜏ | 1 | 10.0% | |
| ᜋ | 1 | 10.0% | |
| ᜅ | 1 | 10.0% | |
| ᜐ | 1 | 10.0% |
Most frequent Yi Radicals characters
| Value | Count | Frequency (%) | |
| ꒱ | 1 | 100.0% |
| Distinct | 9345 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 10365 |
| Missing (%) | 22.5% |
| Memory size | 360.0 KiB |
| India | 1237 |
|---|---|
| Toronto, Canada and Worldwide | 1026 |
| New Delhi, India | 529 |
| United States | 468 |
| London, England | 411 |
| Other values (9340) |
| Value | Count | Frequency (%) | |
| India | 1237 | 2.7% | |
| Toronto, Canada and Worldwide | 1026 | 2.2% | |
| New Delhi, India | 529 | 1.1% | |
| United States | 468 | 1.0% | |
| London, England | 411 | 0.9% | |
| Beijing, China | 399 | 0.9% | |
| Mumbai, India | 336 | 0.7% | |
| London | 307 | 0.7% | |
| Beijing | 303 | 0.7% | |
| New Delhi | 236 | 0.5% | |
| United Kingdom | 212 | 0.5% | |
| New York, NY | 188 | 0.4% | |
| Moscow, Russia | 183 | 0.4% | |
| USA | 180 | 0.4% | |
| Canada | 176 | 0.4% | |
| Los Angeles, CA | 172 | 0.4% | |
| Moscow, Russia | 170 | 0.4% | |
| Malaysia | 164 | 0.4% | |
| Washington, DC | 161 | 0.3% | |
| UK | 150 | 0.3% | |
| Mumbai | 149 | 0.3% | |
| Hong Kong | 147 | 0.3% | |
| Nairobi, Kenya | 146 | 0.3% | |
| California, USA | 144 | 0.3% | |
| Pakistan | 144 | 0.3% | |
| Other values (9320) | 27956 | 60.7% | |
| (Missing) | 10365 | 22.5% |
Frequencies of value counts
Unique
| Unique | 6277 ? |
|---|---|
| Unique (%) | 17.6% |
Histogram of lengths of the category
Length
| Max length | 129 |
|---|---|
| Median length | 11 |
| Mean length | 11.78835841 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 58747 | 10.8% | |
| n | 57651 | 10.6% | |
| 45505 | 8.4% | ||
| i | 32311 | 6.0% | |
| e | 31222 | 5.8% | |
| o | 27020 | 5.0% | |
| r | 22724 | 4.2% | |
| d | 20122 | 3.7% | |
| , | 19434 | 3.6% | |
| l | 18703 | 3.4% | |
| t | 18635 | 3.4% | |
| s | 15199 | 2.8% | |
| h | 11213 | 2.1% | |
| u | 9799 | 1.8% | |
| g | 8788 | 1.6% | |
| C | 7755 | 1.4% | |
| A | 7535 | 1.4% | |
| S | 6578 | 1.2% | |
| I | 6257 | 1.2% | |
| m | 6089 | 1.1% | |
| w | 5726 | 1.1% | |
| N | 5523 | 1.0% | |
| c | 5438 | 1.0% | |
| b | 5360 | 1.0% | |
| y | 4679 | 0.9% | |
| Other values (895) | 84947 | 15.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 375587 | 69.2% | |
| Uppercase Letter | 85339 | 15.7% | |
| Space Separator | 45509 | 8.4% | |
| Other Punctuation | 23310 | 4.3% | |
| Decimal Number | 4241 | 0.8% | |
| Other Letter | 4132 | 0.8% | |
| Other Symbol | 1946 | 0.4% | |
| Dash Punctuation | 858 | 0.2% | |
| Nonspacing Mark | 580 | 0.1% | |
| Spacing Mark | 527 | 0.1% | |
| Math Symbol | 411 | 0.1% | |
| Close Punctuation | 176 | < 0.1% | |
| Open Punctuation | 171 | < 0.1% | |
| Format | 45 | < 0.1% | |
| Modifier Symbol | 34 | < 0.1% | |
| Final Punctuation | 32 | < 0.1% | |
| Connector Punctuation | 22 | < 0.1% | |
| Control | 14 | < 0.1% | |
| Initial Punctuation | 10 | < 0.1% | |
| Modifier Letter | 8 | < 0.1% | |
| Currency Symbol | 3 | < 0.1% | |
| Enclosing Mark | 3 | < 0.1% | |
| Other Number | 2 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 7755 | 9.1% | |
| A | 7535 | 8.8% | |
| S | 6578 | 7.7% | |
| I | 6257 | 7.3% | |
| N | 5523 | 6.5% | |
| U | 4613 | 5.4% | |
| B | 4541 | 5.3% | |
| M | 4362 | 5.1% | |
| L | 4077 | 4.8% | |
| T | 4043 | 4.7% | |
| E | 3503 | 4.1% | |
| D | 3466 | 4.1% | |
| W | 3163 | 3.7% | |
| P | 3056 | 3.6% | |
| K | 2950 | 3.5% | |
| R | 1970 | 2.3% | |
| H | 1939 | 2.3% | |
| O | 1804 | 2.1% | |
| Y | 1638 | 1.9% | |
| G | 1638 | 1.9% | |
| F | 1482 | 1.7% | |
| V | 1128 | 1.3% | |
| J | 753 | 0.9% | |
| X | 493 | 0.6% | |
| Z | 417 | 0.5% | |
| Other values (68) | 655 | 0.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 58747 | 15.6% | |
| n | 57651 | 15.3% | |
| i | 32311 | 8.6% | |
| e | 31222 | 8.3% | |
| o | 27020 | 7.2% | |
| r | 22724 | 6.1% | |
| d | 20122 | 5.4% | |
| l | 18703 | 5.0% | |
| t | 18635 | 5.0% | |
| s | 15199 | 4.0% | |
| h | 11213 | 3.0% | |
| u | 9799 | 2.6% | |
| g | 8788 | 2.3% | |
| m | 6089 | 1.6% | |
| w | 5726 | 1.5% | |
| c | 5438 | 1.4% | |
| b | 5360 | 1.4% | |
| y | 4679 | 1.2% | |
| k | 4017 | 1.1% | |
| p | 3306 | 0.9% | |
| f | 2506 | 0.7% | |
| v | 2192 | 0.6% | |
| j | 1292 | 0.3% | |
| x | 797 | 0.2% | |
| z | 612 | 0.2% | |
| Other values (153) | 1439 | 0.4% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 45505 | > 99.9% | ||
| 3 | < 0.1% | ||
| 1 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 851 | 99.2% | |
| — | 4 | 0.5% | |
| – | 3 | 0.3% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 19434 | 83.4% | |
| . | 1752 | 7.5% | |
| / | 758 | 3.3% | |
| & | 297 | 1.3% | |
| # | 281 | 1.2% | |
| : | 225 | 1.0% | |
| ' | 195 | 0.8% | |
| ! | 126 | 0.5% | |
| @ | 105 | 0.5% | |
| • | 32 | 0.1% | |
| ? | 25 | 0.1% | |
| " | 18 | 0.1% | |
| ; | 15 | 0.1% | |
| * | 15 | 0.1% | |
| ، | 6 | < 0.1% | |
| % | 5 | < 0.1% | |
| ′ | 4 | < 0.1% | |
| † | 3 | < 0.1% | |
| \ | 2 | < 0.1% | |
| 。 | 2 | < 0.1% | |
| ″ | 2 | < 0.1% | |
| , | 1 | < 0.1% | |
| · | 1 | < 0.1% | |
| ¿ | 1 | < 0.1% | |
| ﹕ | 1 | < 0.1% | |
| Other values (4) | 4 | < 0.1% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| र | 275 | 6.7% | |
| ا | 263 | 6.4% | |
| त | 194 | 4.7% | |
| भ | 159 | 3.8% | |
| ل | 126 | 3.0% | |
| ल | 100 | 2.4% | |
| ر | 98 | 2.4% | |
| 京 | 94 | 2.3% | |
| 中 | 90 | 2.2% | |
| 国 | 89 | 2.2% | |
| 华 | 87 | 2.1% | |
| 人 | 87 | 2.1% | |
| 民 | 87 | 2.1% | |
| 共 | 87 | 2.1% | |
| 和 | 87 | 2.1% | |
| 北 | 85 | 2.1% | |
| م | 83 | 2.0% | |
| ة | 73 | 1.8% | |
| ت | 72 | 1.7% | |
| म | 64 | 1.5% | |
| क | 61 | 1.5% | |
| न | 59 | 1.4% | |
| ह | 55 | 1.3% | |
| द | 53 | 1.3% | |
| د | 52 | 1.3% | |
| Other values (246) | 1552 | 37.6% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| 🇺 | 125 | 6.4% | |
| 🇳 | 105 | 5.4% | |
| 🇸 | 96 | 4.9% | |
| 🇮 | 84 | 4.3% | |
| 🇬 | 82 | 4.2% | |
| 🇪 | 74 | 3.8% | |
| 🇧 | 73 | 3.8% | |
| 🇦 | 68 | 3.5% | |
| 🌍 | 59 | 3.0% | |
| 🇨 | 55 | 2.8% | |
| 🇵 | 53 | 2.7% | |
| 🌎 | 42 | 2.2% | |
| ° | 41 | 2.1% | |
| 🇰 | 38 | 2.0% | |
| ✈ | 37 | 1.9% | |
| 🌏 | 29 | 1.5% | |
| ➡ | 29 | 1.5% | |
| 🇱 | 28 | 1.4% | |
| 💶 | 26 | 1.3% | |
| 💵 | 26 | 1.3% | |
| 💷 | 26 | 1.3% | |
| ❤ | 25 | 1.3% | |
| 🏆 | 24 | 1.2% | |
| ⠀ | 23 | 1.2% | |
| 🇷 | 22 | 1.1% | |
| Other values (196) | 656 | 33.7% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| | | 242 | 58.9% | |
| + | 75 | 18.2% | |
| = | 47 | 11.4% | |
| | | 18 | 4.4% | |
| ~ | 16 | 3.9% | |
| ↔ | 3 | 0.7% | |
| ∪ | 3 | 0.7% | |
| → | 1 | 0.2% | |
| ≠ | 1 | 0.2% | |
| ∀ | 1 | 0.2% | |
| ∞ | 1 | 0.2% | |
| ⅃ | 1 | 0.2% | |
| ↑ | 1 | 0.2% | |
| ↓ | 1 | 0.2% |
Most frequent Nonspacing Mark characters
| Value | Count | Frequency (%) | |
| ️ | 155 | 26.7% | |
| ् | 129 | 22.2% | |
| ु | 69 | 11.9% | |
| ं | 46 | 7.9% | |
| े | 40 | 6.9% | |
| ี | 28 | 4.8% | |
| ู | 28 | 4.8% | |
| ︎ | 14 | 2.4% | |
| ू | 11 | 1.9% | |
| ் | 9 | 1.6% | |
| ್ | 6 | 1.0% | |
| ृ | 6 | 1.0% | |
| ँ | 5 | 0.9% | |
| ಿ | 4 | 0.7% | |
| ै | 4 | 0.7% | |
| ్ | 4 | 0.7% | |
| ُ | 3 | 0.5% | |
| ୁ | 2 | 0.3% | |
| ೆ | 2 | 0.3% | |
| ़ | 2 | 0.3% | |
| ુ | 2 | 0.3% | |
| ି | 1 | 0.2% | |
| ٍ | 1 | 0.2% | |
| َ | 1 | 0.2% | |
| ా | 1 | 0.2% | |
| Other values (7) | 7 | 1.2% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 651 | 15.4% | |
| 1 | 590 | 13.9% | |
| 2 | 492 | 11.6% | |
| 7 | 453 | 10.7% | |
| 3 | 435 | 10.3% | |
| 9 | 360 | 8.5% | |
| 5 | 345 | 8.1% | |
| 4 | 342 | 8.1% | |
| 8 | 290 | 6.8% | |
| 6 | 282 | 6.6% | |
| ९ | 1 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 164 | 95.9% | |
| [ | 5 | 2.9% | |
| 「 | 1 | 0.6% | |
| 【 | 1 | 0.6% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 169 | 96.0% | |
| ] | 5 | 2.8% | |
| 」 | 1 | 0.6% | |
| 】 | 1 | 0.6% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| 🏾 | 12 | 35.3% | |
| 🏻 | 12 | 35.3% | |
| 🏽 | 3 | 8.8% | |
| 🏼 | 2 | 5.9% | |
| ¯ | 2 | 5.9% | |
| 🏿 | 2 | 5.9% | |
| ^ | 1 | 2.9% |
Most frequent Format characters
| Value | Count | Frequency (%) | |
| | 20 | 44.4% | |
| | 5 | 11.1% | |
| | 4 | 8.9% | |
| | 4 | 8.9% | |
| | 3 | 6.7% | |
| | 2 | 4.4% | |
| | 2 | 4.4% | |
| | 1 | 2.2% | |
| | 1 | 2.2% | |
| | 1 | 2.2% | |
| | 1 | 2.2% | |
| | 1 | 2.2% |
Most frequent Final Punctuation characters
| Value | Count | Frequency (%) | |
| ’ | 23 | 71.9% | |
| ” | 6 | 18.8% | |
| » | 3 | 9.4% |
Most frequent Spacing Mark characters
| Value | Count | Frequency (%) | |
| ा | 317 | 60.2% | |
| ि | 72 | 13.7% | |
| ी | 67 | 12.7% | |
| ो | 21 | 4.0% | |
| ು | 8 | 1.5% | |
| ಾ | 7 | 1.3% | |
| ா | 7 | 1.3% | |
| ி | 5 | 0.9% | |
| ಂ | 3 | 0.6% | |
| ு | 3 | 0.6% | |
| ା | 2 | 0.4% | |
| ೂ | 2 | 0.4% | |
| ः | 2 | 0.4% | |
| ા | 2 | 0.4% | |
| ே | 2 | 0.4% | |
| ు | 1 | 0.2% | |
| ೃ | 1 | 0.2% | |
| ೇ | 1 | 0.2% | |
| া | 1 | 0.2% | |
| ை | 1 | 0.2% | |
| ం | 1 | 0.2% | |
| ూ | 1 | 0.2% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 14 | 100.0% |
Most frequent Modifier Letter characters
| Value | Count | Frequency (%) | |
| ー | 4 | 50.0% | |
| ـ | 2 | 25.0% | |
| ʻ | 2 | 25.0% |
Most frequent Initial Punctuation characters
| Value | Count | Frequency (%) | |
| “ | 6 | 60.0% | |
| « | 3 | 30.0% | |
| ‘ | 1 | 10.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 22 | 100.0% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ½ | 1 | 50.0% | |
| ㊉ | 1 | 50.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 2 | 66.7% | |
| ₱ | 1 | 33.3% |
Most frequent Enclosing Mark characters
| Value | Count | Frequency (%) | |
| ⃣ | 3 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 460076 | 84.7% | |
| Common | 76877 | 14.2% | |
| Devanagari | 2238 | 0.4% | |
| Arabic | 1190 | 0.2% | |
| Han | 990 | 0.2% | |
| Cyrillic | 607 | 0.1% | |
| Thai | 252 | < 0.1% | |
| Inherited | 198 | < 0.1% | |
| Greek | 94 | < 0.1% | |
| Kannada | 90 | < 0.1% | |
| Katakana | 86 | < 0.1% | |
| Tamil | 68 | < 0.1% | |
| Hangul | 42 | < 0.1% | |
| Telugu | 29 | < 0.1% | |
| Braille | 23 | < 0.1% | |
| Oriya | 16 | < 0.1% | |
| Hebrew | 15 | < 0.1% | |
| Canadian_Aboriginal | 13 | < 0.1% | |
| Gujarati | 12 | < 0.1% | |
| Cherokee | 11 | < 0.1% | |
| Bengali | 11 | < 0.1% | |
| Coptic | 6 | < 0.1% | |
| Gurmukhi | 6 | < 0.1% | |
| Hiragana | 4 | < 0.1% | |
| Yi | 2 | < 0.1% | |
| Other values (4) | 4 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 58747 | 12.8% | |
| n | 57651 | 12.5% | |
| i | 32311 | 7.0% | |
| e | 31222 | 6.8% | |
| o | 27020 | 5.9% | |
| r | 22724 | 4.9% | |
| d | 20122 | 4.4% | |
| l | 18703 | 4.1% | |
| t | 18635 | 4.1% | |
| s | 15199 | 3.3% | |
| h | 11213 | 2.4% | |
| u | 9799 | 2.1% | |
| g | 8788 | 1.9% | |
| C | 7755 | 1.7% | |
| A | 7535 | 1.6% | |
| S | 6578 | 1.4% | |
| I | 6257 | 1.4% | |
| m | 6089 | 1.3% | |
| w | 5726 | 1.2% | |
| N | 5523 | 1.2% | |
| c | 5438 | 1.2% | |
| b | 5360 | 1.2% | |
| y | 4679 | 1.0% | |
| U | 4613 | 1.0% | |
| B | 4541 | 1.0% | |
| Other values (86) | 57848 | 12.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 45505 | 59.2% | ||
| , | 19434 | 25.3% | |
| . | 1752 | 2.3% | |
| - | 851 | 1.1% | |
| / | 758 | 1.0% | |
| 0 | 651 | 0.8% | |
| 1 | 590 | 0.8% | |
| 2 | 492 | 0.6% | |
| 7 | 453 | 0.6% | |
| 3 | 435 | 0.6% | |
| 9 | 360 | 0.5% | |
| 5 | 345 | 0.4% | |
| 4 | 342 | 0.4% | |
| & | 297 | 0.4% | |
| 8 | 290 | 0.4% | |
| 6 | 282 | 0.4% | |
| # | 281 | 0.4% | |
| | | 242 | 0.3% | |
| : | 225 | 0.3% | |
| ' | 195 | 0.3% | |
| ) | 169 | 0.2% | |
| ( | 164 | 0.2% | |
| ! | 126 | 0.2% | |
| 🇺 | 125 | 0.2% | |
| @ | 105 | 0.1% | |
| Other values (369) | 2408 | 3.1% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 263 | 22.1% | |
| ل | 126 | 10.6% | |
| ر | 98 | 8.2% | |
| م | 83 | 7.0% | |
| ة | 73 | 6.1% | |
| ت | 72 | 6.1% | |
| د | 52 | 4.4% | |
| ب | 50 | 4.2% | |
| ي | 48 | 4.0% | |
| ن | 42 | 3.5% | |
| س | 35 | 2.9% | |
| ی | 35 | 2.9% | |
| ع | 33 | 2.8% | |
| و | 25 | 2.1% | |
| ح | 24 | 2.0% | |
| ک | 17 | 1.4% | |
| ہ | 16 | 1.3% | |
| پ | 13 | 1.1% | |
| ز | 12 | 1.0% | |
| إ | 9 | 0.8% | |
| ق | 9 | 0.8% | |
| ج | 8 | 0.7% | |
| ش | 8 | 0.7% | |
| ض | 6 | 0.5% | |
| أ | 6 | 0.5% | |
| Other values (10) | 27 | 2.3% |
Most frequent Inherited characters
| Value | Count | Frequency (%) | |
| ️ | 155 | 78.3% | |
| | 20 | 10.1% | |
| ︎ | 14 | 7.1% | |
| ُ | 3 | 1.5% | |
| ⃣ | 3 | 1.5% | |
| ٍ | 1 | 0.5% | |
| َ | 1 | 0.5% | |
| ً | 1 | 0.5% |
Most frequent Han characters
| Value | Count | Frequency (%) | |
| 京 | 94 | 9.5% | |
| 中 | 90 | 9.1% | |
| 国 | 89 | 9.0% | |
| 华 | 87 | 8.8% | |
| 人 | 87 | 8.8% | |
| 民 | 87 | 8.8% | |
| 共 | 87 | 8.8% | |
| 和 | 87 | 8.8% | |
| 北 | 85 | 8.6% | |
| 市 | 17 | 1.7% | |
| 新 | 15 | 1.5% | |
| 日 | 12 | 1.2% | |
| 本 | 12 | 1.2% | |
| 潟 | 11 | 1.1% | |
| 東 | 10 | 1.0% | |
| 区 | 10 | 1.0% | |
| 加 | 8 | 0.8% | |
| 都 | 6 | 0.6% | |
| 品 | 6 | 0.6% | |
| 川 | 6 | 0.6% | |
| 坡 | 4 | 0.4% | |
| 上 | 4 | 0.4% | |
| 海 | 4 | 0.4% | |
| 溫 | 4 | 0.4% | |
| 哥 | 4 | 0.4% | |
| Other values (31) | 64 | 6.5% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| с | 64 | 10.5% | |
| и | 53 | 8.7% | |
| а | 49 | 8.1% | |
| о | 47 | 7.7% | |
| е | 32 | 5.3% | |
| я | 31 | 5.1% | |
| р | 29 | 4.8% | |
| н | 27 | 4.4% | |
| к | 24 | 4.0% | |
| Р | 23 | 3.8% | |
| Л | 17 | 2.8% | |
| С | 17 | 2.8% | |
| в | 16 | 2.6% | |
| т | 14 | 2.3% | |
| б | 14 | 2.3% | |
| М | 13 | 2.1% | |
| А | 13 | 2.1% | |
| г | 10 | 1.6% | |
| К | 9 | 1.5% | |
| л | 9 | 1.5% | |
| Е | 9 | 1.5% | |
| й | 9 | 1.5% | |
| у | 8 | 1.3% | |
| м | 7 | 1.2% | |
| П | 7 | 1.2% | |
| Other values (24) | 56 | 9.2% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 대 | 2 | 4.8% | |
| 한 | 2 | 4.8% | |
| 민 | 2 | 4.8% | |
| 국 | 2 | 4.8% | |
| 서 | 2 | 4.8% | |
| 울 | 2 | 4.8% | |
| 우 | 1 | 2.4% | |
| 리 | 1 | 2.4% | |
| 함 | 1 | 2.4% | |
| 께 | 1 | 2.4% | |
| 라 | 1 | 2.4% | |
| 면 | 1 | 2.4% | |
| 사 | 1 | 2.4% | |
| 막 | 1 | 2.4% | |
| 도 | 1 | 2.4% | |
| 바 | 1 | 2.4% | |
| 다 | 1 | 2.4% | |
| 가 | 1 | 2.4% | |
| 돼 | 1 | 2.4% | |
| 방 | 1 | 2.4% | |
| 탄 | 1 | 2.4% | |
| 육 | 1 | 2.4% | |
| 성 | 1 | 2.4% | |
| 재 | 1 | 2.4% | |
| 김 | 1 | 2.4% | |
| Other values (11) | 11 | 26.2% |
Most frequent Greek characters
| Value | Count | Frequency (%) | |
| Λ | 16 | 17.0% | |
| Ε | 11 | 11.7% | |
| Α | 9 | 9.6% | |
| α | 9 | 9.6% | |
| Σ | 8 | 8.5% | |
| λ | 7 | 7.4% | |
| ά | 4 | 4.3% | |
| ν | 4 | 4.3% | |
| ς | 3 | 3.2% | |
| τ | 3 | 3.2% | |
| ο | 2 | 2.1% | |
| ι | 2 | 2.1% | |
| ῶ | 2 | 2.1% | |
| υ | 2 | 2.1% | |
| Κ | 1 | 1.1% | |
| ζ | 1 | 1.1% | |
| η | 1 | 1.1% | |
| κ | 1 | 1.1% | |
| ή | 1 | 1.1% | |
| β | 1 | 1.1% | |
| σ | 1 | 1.1% | |
| ε | 1 | 1.1% | |
| ί | 1 | 1.1% | |
| ὐ | 1 | 1.1% | |
| ρ | 1 | 1.1% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| ा | 317 | 14.2% | |
| र | 275 | 12.3% | |
| त | 194 | 8.7% | |
| भ | 159 | 7.1% | |
| ् | 129 | 5.8% | |
| ल | 100 | 4.5% | |
| ि | 72 | 3.2% | |
| ु | 69 | 3.1% | |
| ी | 67 | 3.0% | |
| म | 64 | 2.9% | |
| क | 61 | 2.7% | |
| न | 59 | 2.6% | |
| ह | 55 | 2.5% | |
| द | 53 | 2.4% | |
| व | 52 | 2.3% | |
| ं | 46 | 2.1% | |
| ग | 43 | 1.9% | |
| स | 41 | 1.8% | |
| े | 40 | 1.8% | |
| ब | 40 | 1.8% | |
| ई | 34 | 1.5% | |
| ट | 27 | 1.2% | |
| प | 22 | 1.0% | |
| ो | 21 | 0.9% | |
| य | 21 | 0.9% | |
| Other values (26) | 177 | 7.9% |
Most frequent Cherokee characters
| Value | Count | Frequency (%) | |
| Ꮥ | 4 | 36.4% | |
| Ꭵ | 4 | 36.4% | |
| Ꮻ | 2 | 18.2% | |
| Ꮢ | 1 | 9.1% |
Most frequent Armenian characters
| Value | Count | Frequency (%) | |
| Ծ | 1 | 100.0% |
Most frequent Coptic characters
| Value | Count | Frequency (%) | |
| Ⲉ | 2 | 33.3% | |
| Ⲁ | 2 | 33.3% | |
| Ϯ | 1 | 16.7% | |
| Ⲧ | 1 | 16.7% |
Most frequent Canadian_Aboriginal characters
| Value | Count | Frequency (%) | |
| ᗩ | 3 | 23.1% | |
| ᕱ | 2 | 15.4% | |
| ᗪ | 2 | 15.4% | |
| ᑎ | 2 | 15.4% | |
| ᗰ | 1 | 7.7% | |
| ᕼ | 1 | 7.7% | |
| ᑌ | 1 | 7.7% | |
| ᗷ | 1 | 7.7% |
Most frequent Yi characters
| Value | Count | Frequency (%) | |
| ꊰ | 1 | 50.0% | |
| ꒝ | 1 | 50.0% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ಕ | 12 | 13.3% | |
| ನ | 12 | 13.3% | |
| ರ | 10 | 11.1% | |
| ು | 8 | 8.9% | |
| ಾ | 7 | 7.8% | |
| ಡ | 6 | 6.7% | |
| ್ | 6 | 6.7% | |
| ಗ | 6 | 6.7% | |
| ಿ | 4 | 4.4% | |
| ಂ | 3 | 3.3% | |
| ಟ | 3 | 3.3% | |
| ಬ | 2 | 2.2% | |
| ೆ | 2 | 2.2% | |
| ಳ | 2 | 2.2% | |
| ೂ | 2 | 2.2% | |
| ಶ | 1 | 1.1% | |
| ೃ | 1 | 1.1% | |
| ೇ | 1 | 1.1% | |
| ಭ | 1 | 1.1% | |
| ತ | 1 | 1.1% |
Most frequent Hebrew characters
| Value | Count | Frequency (%) | |
| י | 3 | 20.0% | |
| ש | 3 | 20.0% | |
| ר | 3 | 20.0% | |
| א | 3 | 20.0% | |
| ל | 3 | 20.0% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| イ | 18 | 20.9% | |
| ス | 12 | 14.0% | |
| ン | 11 | 12.8% | |
| ド | 9 | 10.5% | |
| ギ | 9 | 10.5% | |
| リ | 9 | 10.5% | |
| バ | 4 | 4.7% | |
| ユ | 2 | 2.3% | |
| ニ | 2 | 2.3% | |
| ツ | 2 | 2.3% | |
| ク | 1 | 1.2% | |
| カ | 1 | 1.2% | |
| ナ | 1 | 1.2% | |
| ダ | 1 | 1.2% | |
| ル | 1 | 1.2% | |
| ッ | 1 | 1.2% | |
| フ | 1 | 1.2% | |
| ラ | 1 | 1.2% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 9 | 13.2% | |
| ம | 8 | 11.8% | |
| த | 7 | 10.3% | |
| ா | 7 | 10.3% | |
| ி | 5 | 7.4% | |
| ழ | 4 | 5.9% | |
| ட | 4 | 5.9% | |
| ன | 3 | 4.4% | |
| ு | 3 | 4.4% | |
| ர | 3 | 4.4% | |
| க | 2 | 2.9% | |
| ய | 2 | 2.9% | |
| ஊ | 2 | 2.9% | |
| ே | 2 | 2.9% | |
| ந | 1 | 1.5% | |
| ச | 1 | 1.5% | |
| ங | 1 | 1.5% | |
| ை | 1 | 1.5% | |
| ப | 1 | 1.5% | |
| உ | 1 | 1.5% | |
| ல | 1 | 1.5% |
Most frequent Oriya characters
| Value | Count | Frequency (%) | |
| ର | 3 | 18.8% | |
| ହ | 3 | 18.8% | |
| ୁ | 2 | 12.5% | |
| ା | 2 | 12.5% | |
| ଦ | 2 | 12.5% | |
| ଇ | 1 | 6.2% | |
| ବ | 1 | 6.2% | |
| ି | 1 | 6.2% | |
| ଆ | 1 | 6.2% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| ม | 28 | 11.1% | |
| ี | 28 | 11.1% | |
| ู | 28 | 11.1% | |
| ต | 28 | 11.1% | |
| ก | 14 | 5.6% | |
| ำ | 14 | 5.6% | |
| แ | 14 | 5.6% | |
| พ | 14 | 5.6% | |
| ง | 14 | 5.6% | |
| ห | 14 | 5.6% | |
| ป | 14 | 5.6% | |
| ร | 14 | 5.6% | |
| ะ | 14 | 5.6% | |
| า | 14 | 5.6% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| ్ | 4 | 13.8% | |
| ర | 3 | 10.3% | |
| మ | 3 | 10.3% | |
| భ | 2 | 6.9% | |
| గ | 2 | 6.9% | |
| ద | 2 | 6.9% | |
| ా | 1 | 3.4% | |
| య | 1 | 3.4% | |
| న | 1 | 3.4% | |
| ు | 1 | 3.4% | |
| ఆ | 1 | 3.4% | |
| ం | 1 | 3.4% | |
| ధ | 1 | 3.4% | |
| ప | 1 | 3.4% | |
| ే | 1 | 3.4% | |
| శ | 1 | 3.4% | |
| ూ | 1 | 3.4% | |
| ి | 1 | 3.4% | |
| ీ | 1 | 3.4% |
Most frequent Gujarati characters
| Value | Count | Frequency (%) | |
| ગ | 2 | 16.7% | |
| ુ | 2 | 16.7% | |
| જ | 2 | 16.7% | |
| ર | 2 | 16.7% | |
| ા | 2 | 16.7% | |
| ત | 2 | 16.7% |
Most frequent Bengali characters
| Value | Count | Frequency (%) | |
| র | 2 | 18.2% | |
| ভ | 1 | 9.1% | |
| া | 1 | 9.1% | |
| ত | 1 | 9.1% | |
| ব | 1 | 9.1% | |
| ্ | 1 | 9.1% | |
| ষ | 1 | 9.1% | |
| অ | 1 | 9.1% | |
| স | 1 | 9.1% | |
| ম | 1 | 9.1% |
Most frequent Gurmukhi characters
| Value | Count | Frequency (%) | |
| ਰ | 2 | 33.3% | |
| ਅ | 1 | 16.7% | |
| ੰ | 1 | 16.7% | |
| ਬ | 1 | 16.7% | |
| ਸ | 1 | 16.7% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠀ | 23 | 100.0% |
Most frequent Syriac characters
| Value | Count | Frequency (%) | |
| ݁ | 1 | 100.0% |
Most frequent Georgian characters
| Value | Count | Frequency (%) | |
| Ⴑ | 1 | 100.0% |
Most frequent Tangut characters
| Value | Count | Frequency (%) | |
| 𗀠 | 1 | 100.0% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| り | 1 | 25.0% | |
| し | 1 | 25.0% | |
| れ | 1 | 25.0% | |
| さ | 1 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 533921 | 98.3% | |
| Devanagari | 2239 | 0.4% | |
| Arabic | 1204 | 0.2% | |
| Enclosed Alphanum Sup | 1020 | 0.2% | |
| CJK | 990 | 0.2% | |
| None | 744 | 0.1% | |
| Latin 1 Sup | 733 | 0.1% | |
| Cyrillic | 604 | 0.1% | |
| Thai | 252 | < 0.1% | |
| VS | 169 | < 0.1% | |
| Dingbats | 150 | < 0.1% | |
| Math Alphanum | 120 | < 0.1% | |
| Punctuation | 105 | < 0.1% | |
| Kannada | 90 | < 0.1% | |
| Katakana | 90 | < 0.1% | |
| Misc Symbols | 85 | < 0.1% | |
| Tamil | 68 | < 0.1% | |
| Latin Ext A | 49 | < 0.1% | |
| Hangul | 42 | < 0.1% | |
| Telugu | 29 | < 0.1% | |
| Tags | 24 | < 0.1% | |
| Braille | 23 | < 0.1% | |
| Emoticons | 21 | < 0.1% | |
| IPA Ext | 18 | < 0.1% | |
| Oriya | 16 | < 0.1% | |
| Other values (31) | 154 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 58747 | 11.0% | |
| n | 57651 | 10.8% | |
| 45505 | 8.5% | ||
| i | 32311 | 6.1% | |
| e | 31222 | 5.8% | |
| o | 27020 | 5.1% | |
| r | 22724 | 4.3% | |
| d | 20122 | 3.8% | |
| , | 19434 | 3.6% | |
| l | 18703 | 3.5% | |
| t | 18635 | 3.5% | |
| s | 15199 | 2.8% | |
| h | 11213 | 2.1% | |
| u | 9799 | 1.8% | |
| g | 8788 | 1.6% | |
| C | 7755 | 1.5% | |
| A | 7535 | 1.4% | |
| S | 6578 | 1.2% | |
| I | 6257 | 1.2% | |
| m | 6089 | 1.1% | |
| w | 5726 | 1.1% | |
| N | 5523 | 1.0% | |
| c | 5438 | 1.0% | |
| b | 5360 | 1.0% | |
| y | 4679 | 0.9% | |
| Other values (66) | 75908 | 14.2% |
Most frequent Latin 1 Sup characters
| Value | Count | Frequency (%) | |
| ü | 188 | 25.6% | |
| ã | 101 | 13.8% | |
| Ü | 80 | 10.9% | |
| é | 80 | 10.9% | |
| á | 64 | 8.7% | |
| ° | 41 | 5.6% | |
| ó | 30 | 4.1% | |
| ë | 25 | 3.4% | |
| ñ | 22 | 3.0% | |
| Ö | 11 | 1.5% | |
| ú | 10 | 1.4% | |
| ö | 9 | 1.2% | |
| í | 8 | 1.1% | |
| ä | 7 | 1.0% | |
| ï | 7 | 1.0% | |
| è | 6 | 0.8% | |
| à | 5 | 0.7% | |
| â | 4 | 0.5% | |
| ø | 4 | 0.5% | |
| É | 3 | 0.4% | |
| 3 | 0.4% | ||
| » | 3 | 0.4% | |
| « | 3 | 0.4% | |
| Ú | 3 | 0.4% | |
| ¯ | 2 | 0.3% | |
| Other values (11) | 14 | 1.9% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 263 | 21.8% | |
| ل | 126 | 10.5% | |
| ر | 98 | 8.1% | |
| م | 83 | 6.9% | |
| ة | 73 | 6.1% | |
| ت | 72 | 6.0% | |
| د | 52 | 4.3% | |
| ب | 50 | 4.2% | |
| ي | 48 | 4.0% | |
| ن | 42 | 3.5% | |
| س | 35 | 2.9% | |
| ی | 35 | 2.9% | |
| ع | 33 | 2.7% | |
| و | 25 | 2.1% | |
| ح | 24 | 2.0% | |
| ک | 17 | 1.4% | |
| ہ | 16 | 1.3% | |
| پ | 13 | 1.1% | |
| ز | 12 | 1.0% | |
| إ | 9 | 0.7% | |
| ق | 9 | 0.7% | |
| ج | 8 | 0.7% | |
| ش | 8 | 0.7% | |
| ض | 6 | 0.5% | |
| أ | 6 | 0.5% | |
| Other values (16) | 41 | 3.4% |
Most frequent Enclosed Alphanum Sup characters
| Value | Count | Frequency (%) | |
| 🇺 | 125 | 12.3% | |
| 🇳 | 105 | 10.3% | |
| 🇸 | 96 | 9.4% | |
| 🇮 | 84 | 8.2% | |
| 🇬 | 82 | 8.0% | |
| 🇪 | 74 | 7.3% | |
| 🇧 | 73 | 7.2% | |
| 🇦 | 68 | 6.7% | |
| 🇨 | 55 | 5.4% | |
| 🇵 | 53 | 5.2% | |
| 🇰 | 38 | 3.7% | |
| 🇱 | 28 | 2.7% | |
| 🇷 | 22 | 2.2% | |
| 🇹 | 20 | 2.0% | |
| 🇲 | 19 | 1.9% | |
| 🇯 | 19 | 1.9% | |
| 🇩 | 14 | 1.4% | |
| 🇭 | 10 | 1.0% | |
| 🇴 | 10 | 1.0% | |
| 🇫 | 9 | 0.9% | |
| 🇽 | 4 | 0.4% | |
| 🇼 | 4 | 0.4% | |
| 🇾 | 3 | 0.3% | |
| 🇿 | 3 | 0.3% | |
| 🇶 | 1 | 0.1% |
Most frequent Dingbats characters
| Value | Count | Frequency (%) | |
| ✈ | 37 | 24.7% | |
| ➡ | 29 | 19.3% | |
| ❤ | 25 | 16.7% | |
| ✊ | 8 | 5.3% | |
| ✨ | 7 | 4.7% | |
| ❂ | 6 | 4.0% | |
| ✫ | 6 | 4.0% | |
| ❁ | 6 | 4.0% | |
| ❄ | 5 | 3.3% | |
| ✔ | 2 | 1.3% | |
| ✡ | 2 | 1.3% | |
| ❣ | 2 | 1.3% | |
| ➰ | 2 | 1.3% | |
| ❌ | 2 | 1.3% | |
| ✳ | 2 | 1.3% | |
| ✴ | 2 | 1.3% | |
| ❇ | 2 | 1.3% | |
| ➕ | 2 | 1.3% | |
| ✋ | 1 | 0.7% | |
| ✝ | 1 | 0.7% | |
| ✌ | 1 | 0.7% |
Most frequent VS characters
| Value | Count | Frequency (%) | |
| ️ | 155 | 91.7% | |
| ︎ | 14 | 8.3% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| 🌍 | 59 | 7.9% | |
| 🌎 | 42 | 5.6% | |
| 🌏 | 29 | 3.9% | |
| 💶 | 26 | 3.5% | |
| 💵 | 26 | 3.5% | |
| 💷 | 26 | 3.5% | |
| 🏆 | 24 | 3.2% | |
| | | 18 | 2.4% | |
| 🌐 | 17 | 2.3% | |
| 💙 | 16 | 2.2% | |
| Λ | 16 | 2.2% | |
| 📍 | 15 | 2.0% | |
| 🌊 | 13 | 1.7% | |
| 🏾 | 12 | 1.6% | |
| 🏻 | 12 | 1.6% | |
| Ε | 11 | 1.5% | |
| 💜 | 11 | 1.5% | |
| 👱 | 10 | 1.3% | |
| 🦞 | 9 | 1.2% | |
| 🦁 | 9 | 1.2% | |
| Α | 9 | 1.2% | |
| α | 9 | 1.2% | |
| 👩 | 8 | 1.1% | |
| 💻 | 8 | 1.1% | |
| 🏡 | 8 | 1.1% | |
| Other values (142) | 301 | 40.5% |
Most frequent Punctuation characters
| Value | Count | Frequency (%) | |
| • | 32 | 30.5% | |
| ’ | 23 | 21.9% | |
| | 20 | 19.0% | |
| “ | 6 | 5.7% | |
| ” | 6 | 5.7% | |
| ′ | 4 | 3.8% | |
| — | 4 | 3.8% | |
| – | 3 | 2.9% | |
| † | 3 | 2.9% | |
| ″ | 2 | 1.9% | |
| | 1 | 1.0% | |
| ‘ | 1 | 1.0% |
Most frequent Misc Symbols characters
| Value | Count | Frequency (%) | |
| ⛵ | 10 | 11.8% | |
| ♀ | 10 | 11.8% | |
| ♻ | 7 | 8.2% | |
| ♫ | 6 | 7.1% | |
| ★ | 6 | 7.1% | |
| ☆ | 5 | 5.9% | |
| ☀ | 5 | 5.9% | |
| ♥ | 4 | 4.7% | |
| ♡ | 4 | 4.7% | |
| ⚓ | 3 | 3.5% | |
| ⚢ | 3 | 3.5% | |
| ☁ | 3 | 3.5% | |
| ☮ | 3 | 3.5% | |
| ☑ | 2 | 2.4% | |
| ⚜ | 2 | 2.4% | |
| ⛅ | 2 | 2.4% | |
| ☃ | 1 | 1.2% | |
| ♅ | 1 | 1.2% | |
| ♂ | 1 | 1.2% | |
| ⚕ | 1 | 1.2% | |
| ⚖ | 1 | 1.2% | |
| ♔ | 1 | 1.2% | |
| ⚾ | 1 | 1.2% | |
| ♑ | 1 | 1.2% | |
| ☪ | 1 | 1.2% |
Most frequent Latin Ext A characters
| Value | Count | Frequency (%) | |
| İ | 13 | 26.5% | |
| č | 13 | 26.5% | |
| Č | 8 | 16.3% | |
| ā | 5 | 10.2% | |
| ş | 3 | 6.1% | |
| ł | 1 | 2.0% | |
| ğ | 1 | 2.0% | |
| œ | 1 | 2.0% | |
| Ā | 1 | 2.0% | |
| Ŧ | 1 | 2.0% | |
| ī | 1 | 2.0% | |
| Ş | 1 | 2.0% |
Most frequent CJK characters
| Value | Count | Frequency (%) | |
| 京 | 94 | 9.5% | |
| 中 | 90 | 9.1% | |
| 国 | 89 | 9.0% | |
| 华 | 87 | 8.8% | |
| 人 | 87 | 8.8% | |
| 民 | 87 | 8.8% | |
| 共 | 87 | 8.8% | |
| 和 | 87 | 8.8% | |
| 北 | 85 | 8.6% | |
| 市 | 17 | 1.7% | |
| 新 | 15 | 1.5% | |
| 日 | 12 | 1.2% | |
| 本 | 12 | 1.2% | |
| 潟 | 11 | 1.1% | |
| 東 | 10 | 1.0% | |
| 区 | 10 | 1.0% | |
| 加 | 8 | 0.8% | |
| 都 | 6 | 0.6% | |
| 品 | 6 | 0.6% | |
| 川 | 6 | 0.6% | |
| 坡 | 4 | 0.4% | |
| 上 | 4 | 0.4% | |
| 海 | 4 | 0.4% | |
| 溫 | 4 | 0.4% | |
| 哥 | 4 | 0.4% | |
| Other values (31) | 64 | 6.5% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| с | 64 | 10.6% | |
| и | 53 | 8.8% | |
| а | 49 | 8.1% | |
| о | 47 | 7.8% | |
| е | 32 | 5.3% | |
| я | 31 | 5.1% | |
| р | 29 | 4.8% | |
| н | 27 | 4.5% | |
| к | 24 | 4.0% | |
| Р | 23 | 3.8% | |
| Л | 17 | 2.8% | |
| С | 17 | 2.8% | |
| в | 16 | 2.6% | |
| т | 14 | 2.3% | |
| б | 14 | 2.3% | |
| М | 13 | 2.2% | |
| А | 13 | 2.2% | |
| г | 10 | 1.7% | |
| К | 9 | 1.5% | |
| л | 9 | 1.5% | |
| Е | 9 | 1.5% | |
| й | 9 | 1.5% | |
| у | 8 | 1.3% | |
| м | 7 | 1.2% | |
| П | 7 | 1.2% | |
| Other values (21) | 53 | 8.8% |
Most frequent Math Alphanum characters
| Value | Count | Frequency (%) | |
| 𝕖 | 6 | 5.0% | |
| 𝗧 | 4 | 3.3% | |
| 𝕒 | 4 | 3.3% | |
| 𝖊 | 3 | 2.5% | |
| 𝖎 | 3 | 2.5% | |
| 𝖓 | 3 | 2.5% | |
| 𝖔 | 3 | 2.5% | |
| 𝑒 | 3 | 2.5% | |
| 𝗮 | 3 | 2.5% | |
| 𝓮 | 2 | 1.7% | |
| 𝒆 | 2 | 1.7% | |
| 𝒅 | 2 | 1.7% | |
| 𝒓 | 2 | 1.7% | |
| 𝖆 | 2 | 1.7% | |
| 𝖙 | 2 | 1.7% | |
| 𝖗 | 2 | 1.7% | |
| 𝖉 | 2 | 1.7% | |
| 𝗘 | 2 | 1.7% | |
| 𝗦 | 2 | 1.7% | |
| 𝚃 | 2 | 1.7% | |
| 𝚎 | 2 | 1.7% | |
| 𝚡 | 2 | 1.7% | |
| 𝚊 | 2 | 1.7% | |
| 𝚜 | 2 | 1.7% | |
| 𝓇 | 2 | 1.7% | |
| Other values (46) | 56 | 46.7% |
Most frequent Emoticons characters
| Value | Count | Frequency (%) | |
| 😉 | 4 | 19.0% | |
| 😂 | 4 | 19.0% | |
| 😎 | 3 | 14.3% | |
| 😊 | 2 | 9.5% | |
| 🙄 | 2 | 9.5% | |
| 🙏 | 1 | 4.8% | |
| 🙌 | 1 | 4.8% | |
| 😌 | 1 | 4.8% | |
| 😝 | 1 | 4.8% | |
| 😢 | 1 | 4.8% | |
| 😷 | 1 | 4.8% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 대 | 2 | 4.8% | |
| 한 | 2 | 4.8% | |
| 민 | 2 | 4.8% | |
| 국 | 2 | 4.8% | |
| 서 | 2 | 4.8% | |
| 울 | 2 | 4.8% | |
| 우 | 1 | 2.4% | |
| 리 | 1 | 2.4% | |
| 함 | 1 | 2.4% | |
| 께 | 1 | 2.4% | |
| 라 | 1 | 2.4% | |
| 면 | 1 | 2.4% | |
| 사 | 1 | 2.4% | |
| 막 | 1 | 2.4% | |
| 도 | 1 | 2.4% | |
| 바 | 1 | 2.4% | |
| 다 | 1 | 2.4% | |
| 가 | 1 | 2.4% | |
| 돼 | 1 | 2.4% | |
| 방 | 1 | 2.4% | |
| 탄 | 1 | 2.4% | |
| 육 | 1 | 2.4% | |
| 성 | 1 | 2.4% | |
| 재 | 1 | 2.4% | |
| 김 | 1 | 2.4% | |
| Other values (11) | 11 | 26.2% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| ा | 317 | 14.2% | |
| र | 275 | 12.3% | |
| त | 194 | 8.7% | |
| भ | 159 | 7.1% | |
| ् | 129 | 5.8% | |
| ल | 100 | 4.5% | |
| ि | 72 | 3.2% | |
| ु | 69 | 3.1% | |
| ी | 67 | 3.0% | |
| म | 64 | 2.9% | |
| क | 61 | 2.7% | |
| न | 59 | 2.6% | |
| ह | 55 | 2.5% | |
| द | 53 | 2.4% | |
| व | 52 | 2.3% | |
| ं | 46 | 2.1% | |
| ग | 43 | 1.9% | |
| स | 41 | 1.8% | |
| े | 40 | 1.8% | |
| ब | 40 | 1.8% | |
| ई | 34 | 1.5% | |
| ट | 27 | 1.2% | |
| प | 22 | 1.0% | |
| ो | 21 | 0.9% | |
| य | 21 | 0.9% | |
| Other values (27) | 178 | 7.9% |
Most frequent Cherokee characters
| Value | Count | Frequency (%) | |
| Ꮥ | 4 | 36.4% | |
| Ꭵ | 4 | 36.4% | |
| Ꮻ | 2 | 18.2% | |
| Ꮢ | 1 | 9.1% |
Most frequent Armenian characters
| Value | Count | Frequency (%) | |
| Ծ | 1 | 100.0% |
Most frequent UCAS characters
| Value | Count | Frequency (%) | |
| ᗩ | 3 | 23.1% | |
| ᕱ | 2 | 15.4% | |
| ᗪ | 2 | 15.4% | |
| ᑎ | 2 | 15.4% | |
| ᗰ | 1 | 7.7% | |
| ᕼ | 1 | 7.7% | |
| ᑌ | 1 | 7.7% | |
| ᗷ | 1 | 7.7% |
Most frequent Yi Syllables characters
| Value | Count | Frequency (%) | |
| ꊰ | 1 | 100.0% |
Most frequent Yi Radicals characters
| Value | Count | Frequency (%) | |
| ꒝ | 1 | 100.0% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ಕ | 12 | 13.3% | |
| ನ | 12 | 13.3% | |
| ರ | 10 | 11.1% | |
| ು | 8 | 8.9% | |
| ಾ | 7 | 7.8% | |
| ಡ | 6 | 6.7% | |
| ್ | 6 | 6.7% | |
| ಗ | 6 | 6.7% | |
| ಿ | 4 | 4.4% | |
| ಂ | 3 | 3.3% | |
| ಟ | 3 | 3.3% | |
| ಬ | 2 | 2.2% | |
| ೆ | 2 | 2.2% | |
| ಳ | 2 | 2.2% | |
| ೂ | 2 | 2.2% | |
| ಶ | 1 | 1.1% | |
| ೃ | 1 | 1.1% | |
| ೇ | 1 | 1.1% | |
| ಭ | 1 | 1.1% | |
| ತ | 1 | 1.1% |
Most frequent Hebrew characters
| Value | Count | Frequency (%) | |
| י | 3 | 20.0% | |
| ש | 3 | 20.0% | |
| ר | 3 | 20.0% | |
| א | 3 | 20.0% | |
| ל | 3 | 20.0% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| イ | 18 | 20.0% | |
| ス | 12 | 13.3% | |
| ン | 11 | 12.2% | |
| ド | 9 | 10.0% | |
| ギ | 9 | 10.0% | |
| リ | 9 | 10.0% | |
| バ | 4 | 4.4% | |
| ー | 4 | 4.4% | |
| ユ | 2 | 2.2% | |
| ニ | 2 | 2.2% | |
| ツ | 2 | 2.2% | |
| ク | 1 | 1.1% | |
| カ | 1 | 1.1% | |
| ナ | 1 | 1.1% | |
| ダ | 1 | 1.1% | |
| ル | 1 | 1.1% | |
| ッ | 1 | 1.1% | |
| フ | 1 | 1.1% | |
| ラ | 1 | 1.1% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 9 | 13.2% | |
| ம | 8 | 11.8% | |
| த | 7 | 10.3% | |
| ா | 7 | 10.3% | |
| ி | 5 | 7.4% | |
| ழ | 4 | 5.9% | |
| ட | 4 | 5.9% | |
| ன | 3 | 4.4% | |
| ு | 3 | 4.4% | |
| ர | 3 | 4.4% | |
| க | 2 | 2.9% | |
| ய | 2 | 2.9% | |
| ஊ | 2 | 2.9% | |
| ே | 2 | 2.9% | |
| ந | 1 | 1.5% | |
| ச | 1 | 1.5% | |
| ங | 1 | 1.5% | |
| ை | 1 | 1.5% | |
| ப | 1 | 1.5% | |
| உ | 1 | 1.5% | |
| ல | 1 | 1.5% |
Most frequent Arrows characters
| Value | Count | Frequency (%) | |
| ↘ | 3 | 25.0% | |
| ↔ | 3 | 25.0% | |
| ↗ | 2 | 16.7% | |
| → | 1 | 8.3% | |
| ↑ | 1 | 8.3% | |
| ↓ | 1 | 8.3% | |
| ↷ | 1 | 8.3% |
Most frequent Misc Technical characters
| Value | Count | Frequency (%) | |
| ⏚ | 1 | 100.0% |
Most frequent Math Operators characters
| Value | Count | Frequency (%) | |
| ∪ | 3 | 50.0% | |
| ≠ | 1 | 16.7% | |
| ∀ | 1 | 16.7% | |
| ∞ | 1 | 16.7% |
Most frequent Oriya characters
| Value | Count | Frequency (%) | |
| ର | 3 | 18.8% | |
| ହ | 3 | 18.8% | |
| ୁ | 2 | 12.5% | |
| ା | 2 | 12.5% | |
| ଦ | 2 | 12.5% | |
| ଇ | 1 | 6.2% | |
| ବ | 1 | 6.2% | |
| ି | 1 | 6.2% | |
| ଆ | 1 | 6.2% |
Most frequent Phonetic Ext characters
| Value | Count | Frequency (%) | |
| ᴀ | 4 | 30.8% | |
| ᴍ | 3 | 23.1% | |
| ᴉ | 2 | 15.4% | |
| ᴛ | 2 | 15.4% | |
| ᴚ | 1 | 7.7% | |
| ᴎ | 1 | 7.7% |
Most frequent Latin Ext B characters
| Value | Count | Frequency (%) | |
| ƚ | 2 | 66.7% | |
| ǝ | 1 | 33.3% |
Most frequent IPA Ext characters
| Value | Count | Frequency (%) | |
| ʀ | 4 | 22.2% | |
| ɘ | 3 | 16.7% | |
| ɒ | 3 | 16.7% | |
| ɪ | 2 | 11.1% | |
| ɿ | 2 | 11.1% | |
| ɥ | 1 | 5.6% | |
| ʇ | 1 | 5.6% | |
| ɔ | 1 | 5.6% | |
| ʞ | 1 | 5.6% |
Most frequent Cyrillic Sup characters
| Value | Count | Frequency (%) | |
| Ԁ | 1 | 100.0% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| ม | 28 | 11.1% | |
| ี | 28 | 11.1% | |
| ู | 28 | 11.1% | |
| ต | 28 | 11.1% | |
| ก | 14 | 5.6% | |
| ำ | 14 | 5.6% | |
| แ | 14 | 5.6% | |
| พ | 14 | 5.6% | |
| ง | 14 | 5.6% | |
| ห | 14 | 5.6% | |
| ป | 14 | 5.6% | |
| ร | 14 | 5.6% | |
| ะ | 14 | 5.6% | |
| า | 14 | 5.6% |
Most frequent Box Drawing characters
| Value | Count | Frequency (%) | |
| └ | 2 | 50.0% | |
| ║ | 2 | 50.0% |
Most frequent Latin Ext D characters
| Value | Count | Frequency (%) | |
| ꜱ | 2 | 100.0% |
Most frequent Tags characters
| Value | Count | Frequency (%) | |
| | 5 | 20.8% | |
| | 4 | 16.7% | |
| | 4 | 16.7% | |
| | 3 | 12.5% | |
| | 2 | 8.3% | |
| | 2 | 8.3% | |
| | 1 | 4.2% | |
| | 1 | 4.2% | |
| | 1 | 4.2% | |
| | 1 | 4.2% |
Most frequent Letterlike Symbols characters
| Value | Count | Frequency (%) | |
| ℕ | 3 | 27.3% | |
| ℜ | 3 | 27.3% | |
| ℓ | 2 | 18.2% | |
| ℂ | 2 | 18.2% | |
| ⅃ | 1 | 9.1% |
Most frequent Currency Symbols characters
| Value | Count | Frequency (%) | |
| ₱ | 1 | 100.0% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| ్ | 4 | 13.8% | |
| ర | 3 | 10.3% | |
| మ | 3 | 10.3% | |
| భ | 2 | 6.9% | |
| గ | 2 | 6.9% | |
| ద | 2 | 6.9% | |
| ా | 1 | 3.4% | |
| య | 1 | 3.4% | |
| న | 1 | 3.4% | |
| ు | 1 | 3.4% | |
| ఆ | 1 | 3.4% | |
| ం | 1 | 3.4% | |
| ధ | 1 | 3.4% | |
| ప | 1 | 3.4% | |
| ే | 1 | 3.4% | |
| శ | 1 | 3.4% | |
| ూ | 1 | 3.4% | |
| ి | 1 | 3.4% | |
| ీ | 1 | 3.4% |
Most frequent Coptic characters
| Value | Count | Frequency (%) | |
| Ⲉ | 2 | 40.0% | |
| Ⲁ | 2 | 40.0% | |
| Ⲧ | 1 | 20.0% |
Most frequent Gujarati characters
| Value | Count | Frequency (%) | |
| ગ | 2 | 16.7% | |
| ુ | 2 | 16.7% | |
| જ | 2 | 16.7% | |
| ર | 2 | 16.7% | |
| ા | 2 | 16.7% | |
| ત | 2 | 16.7% |
Most frequent Bengali characters
| Value | Count | Frequency (%) | |
| র | 2 | 18.2% | |
| ভ | 1 | 9.1% | |
| া | 1 | 9.1% | |
| ত | 1 | 9.1% | |
| ব | 1 | 9.1% | |
| ্ | 1 | 9.1% | |
| ষ | 1 | 9.1% | |
| অ | 1 | 9.1% | |
| স | 1 | 9.1% | |
| ম | 1 | 9.1% |
Most frequent Greek Ext characters
| Value | Count | Frequency (%) | |
| ῶ | 2 | 66.7% | |
| ὐ | 1 | 33.3% |
Most frequent Gurmukhi characters
| Value | Count | Frequency (%) | |
| ਰ | 2 | 33.3% | |
| ਅ | 1 | 16.7% | |
| ੰ | 1 | 16.7% | |
| ਬ | 1 | 16.7% | |
| ਸ | 1 | 16.7% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠀ | 23 | 100.0% |
Most frequent Syriac characters
| Value | Count | Frequency (%) | |
| ݁ | 1 | 100.0% |
Most frequent Small Forms characters
| Value | Count | Frequency (%) | |
| ﹕ | 1 | 100.0% |
Most frequent Playing Cards characters
| Value | Count | Frequency (%) | |
| 🃏 | 1 | 100.0% |
Most frequent Georgian characters
| Value | Count | Frequency (%) | |
| Ⴑ | 1 | 100.0% |
Most frequent Latin Ext Additional characters
| Value | Count | Frequency (%) | |
| ṃ | 2 | 40.0% | |
| ṛ | 2 | 40.0% | |
| ệ | 1 | 20.0% |
Most frequent Modifier Letters characters
| Value | Count | Frequency (%) | |
| ʻ | 2 | 100.0% |
Most frequent Cyrillic Ext B characters
| Value | Count | Frequency (%) | |
| ꙅ | 1 | 50.0% | |
| Ꙅ | 1 | 50.0% |
Most frequent Geometric Shapes characters
| Value | Count | Frequency (%) | |
| ▫ | 3 | 100.0% |
Most frequent Tangut characters
| Value | Count | Frequency (%) | |
| 𗀠 | 1 | 100.0% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| り | 1 | 25.0% | |
| し | 1 | 25.0% | |
| れ | 1 | 25.0% | |
| さ | 1 | 25.0% |
| Distinct | 24494 |
|---|---|
| Distinct (%) | 57.0% |
| Missing | 3090 |
| Missing (%) | 6.7% |
| Memory size | 360.0 KiB |
| George Tsanis – Workout Solutions Health and Fitness Consultants since 1996 – One-on-one and online distance coaching – Toronto, Canada, World | 1026 |
|---|---|
| Sputnik is a global wire, radio and digital news service. We exist to tell the stories that aren’t being told. | 283 |
| Latest business news and valuable information from China. | 184 |
| Official Twitter account of Ilke News Agency / | 135 |
| | political | cats | bikes | civil rights | tech | photography | 126 |
| Other values (24489) |
| Value | Count | Frequency (%) | |
| George Tsanis – Workout Solutions Health and Fitness Consultants since 1996 – One-on-one and online distance coaching – Toronto, Canada, World | 1026 | 2.2% | |
| Sputnik is a global wire, radio and digital news service. We exist to tell the stories that aren’t being told. | 283 | 0.6% | |
| Latest business news and valuable information from China. | 184 | 0.4% | |
| Official Twitter account of Ilke News Agency / | 135 | 0.3% | |
| | political | cats | bikes | civil rights | tech | photography | 126 | 0.3% | |
| We are a group of traders who are here to impart financial education | 121 | 0.3% | |
| The largest newspaper in China | 110 | 0.2% | |
| Mask-loving, Trump-hating liberal opposed to genetic vaccines🌲🌲 Warp Speed denied U.S. access to traditional vaccines. Until a vaccine there's Ivermectin | 95 | 0.2% | |
| News, views and up-to-date reports from Malaysia's premier news source. All that and more at https://t.co/S8jbx5pMaF | 91 | 0.2% | |
| Brazil SFE®| We are passionate about improving our world with #Artificialintelligence, #Automation, #Analytics #innovation #digital #ai #vr #ar #ml #rpa | 90 | 0.2% | |
| The official twitter account of the Embassy of the People's Republic of China in the Republic of the Philippines | 88 | 0.2% | |
| CGTN is an international media organization. It aims to provide global audiences with accurate and timely news coverage as well as rich audiovisual services. | 79 | 0.2% | |
| The official account of The Peninsula English Daily Newspaper #Qatar #Doha | 73 | 0.2% | |
| I just share my Passion for the Stock Market & my own Conviction. Positive Mindeset🌞 $OCGN $SYA $CBDT $QYOU No financial Advice or Buy Recommendation📈 | 71 | 0.2% | |
| ✍️ INFORMED CONSENT in a socially responsible and just manner. Society has responsibility to ensure means of compensating those with vaccine-related injuries. | 68 | 0.1% | |
| CCTV+ is a leading video news agency in China that offers Chinese news and Chinese perspective on international news. | 68 | 0.1% | |
| Reporting Africa, BRI; Africa fellow in Univ; Years in Africa; Charhar Inst. N.S Korea study;Analyst on China overseas political & economic stakes.Personal view | 63 | 0.1% | |
| Investor with positive & fresh Mindeset🌴☀️ $OCGN $CBDT $CWGYF $QYOU $SYA $IPNFF No financial Advice or Buy Recommendation📈 | 60 | 0.1% | |
| Ex @Tibetans supports Tibetan National Resistance against China’s Military Occupation. When Dalai Lama escaped into Exile, CCP’s dictator Mao said:WE LOST TIBET | 56 | 0.1% | |
| Freedom over censorship, truth over narrative On air in 100+ countries. 10+ billion video views in 2020 Don’t want your news filtered - find us at https://t.co/9V8JaMqU2C | 56 | 0.1% | |
| Human being one of many not for sale | 55 | 0.1% | |
| India's largest independent News Agency | 55 | 0.1% | |
| love the beauty of the nature and I am beautiful too, student of commerce, love to sing,dance and travel at new places | 53 | 0.1% | |
| Professional trading consultant specializing in contracting and due diligence with a strong presence and network in international markets. | 52 | 0.1% | |
| Research Consultant: Political-Economy Analysis,Geopolitics. #Russia, Ex-USSR,M.East. https://t.co/bnDgl9eyg4, https://t.co/KRorvOFoW1, PMPMag Rt#End | 52 | 0.1% | |
| Other values (24469) | 39759 | 86.3% | |
| (Missing) | 3090 | 6.7% |
Frequencies of value counts
Unique
| Unique | 19501 ? |
|---|---|
| Unique (%) | 45.4% |
Histogram of lengths of the category
Length
| Max length | 248 |
|---|---|
| Median length | 115 |
| Mean length | 101.1463775 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 632325 | 13.6% | ||
| e | 366512 | 7.9% | |
| a | 275198 | 5.9% | |
| i | 266213 | 5.7% | |
| n | 264026 | 5.7% | |
| o | 261927 | 5.6% | |
| t | 259416 | 5.6% | |
| r | 223533 | 4.8% | |
| s | 211119 | 4.5% | |
| l | 147031 | 3.2% | |
| c | 111552 | 2.4% | |
| d | 110103 | 2.4% | |
| h | 99119 | 2.1% | |
| u | 89833 | 1.9% | |
| m | 78464 | 1.7% | |
| g | 65123 | 1.4% | |
| p | 64489 | 1.4% | |
| . | 64285 | 1.4% | |
| f | 58055 | 1.2% | |
| y | 53920 | 1.2% | |
| , | 51325 | 1.1% | |
| w | 48144 | 1.0% | |
| v | 42251 | 0.9% | |
| b | 37889 | 0.8% | |
| C | 31995 | 0.7% | |
| Other values (3516) | 744854 | 16.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3193530 | 68.5% | |
| Space Separator | 632501 | 13.6% | |
| Uppercase Letter | 418380 | 9.0% | |
| Other Punctuation | 223414 | 4.8% | |
| Decimal Number | 39240 | 0.8% | |
| Other Symbol | 32666 | 0.7% | |
| Other Letter | 29724 | 0.6% | |
| Control | 18412 | 0.4% | |
| Dash Punctuation | 17927 | 0.4% | |
| Math Symbol | 17078 | 0.4% | |
| Nonspacing Mark | 10566 | 0.2% | |
| Spacing Mark | 5580 | 0.1% | |
| Close Punctuation | 3508 | 0.1% | |
| Open Punctuation | 3372 | 0.1% | |
| Final Punctuation | 3230 | 0.1% | |
| Format | 2720 | 0.1% | |
| Connector Punctuation | 2532 | 0.1% | |
| Currency Symbol | 1762 | < 0.1% | |
| Modifier Symbol | 1287 | < 0.1% | |
| Initial Punctuation | 692 | < 0.1% | |
| Modifier Letter | 485 | < 0.1% | |
| Enclosing Mark | 52 | < 0.1% | |
| Private Use | 27 | < 0.1% | |
| Other Number | 16 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 31995 | 7.6% | |
| S | 30556 | 7.3% | |
| A | 29590 | 7.1% | |
| T | 29080 | 7.0% | |
| I | 25973 | 6.2% | |
| P | 23744 | 5.7% | |
| M | 23473 | 5.6% | |
| E | 21476 | 5.1% | |
| N | 19355 | 4.6% | |
| R | 18525 | 4.4% | |
| D | 17983 | 4.3% | |
| B | 17775 | 4.2% | |
| F | 17155 | 4.1% | |
| L | 15693 | 3.8% | |
| O | 15042 | 3.6% | |
| H | 14841 | 3.5% | |
| W | 13998 | 3.3% | |
| G | 12878 | 3.1% | |
| U | 9473 | 2.3% | |
| V | 8035 | 1.9% | |
| J | 5899 | 1.4% | |
| K | 4977 | 1.2% | |
| Y | 4763 | 1.1% | |
| Q | 1929 | 0.5% | |
| X | 1519 | 0.4% | |
| Other values (225) | 2653 | 0.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 366512 | 11.5% | |
| a | 275198 | 8.6% | |
| i | 266213 | 8.3% | |
| n | 264026 | 8.3% | |
| o | 261927 | 8.2% | |
| t | 259416 | 8.1% | |
| r | 223533 | 7.0% | |
| s | 211119 | 6.6% | |
| l | 147031 | 4.6% | |
| c | 111552 | 3.5% | |
| d | 110103 | 3.4% | |
| h | 99119 | 3.1% | |
| u | 89833 | 2.8% | |
| m | 78464 | 2.5% | |
| g | 65123 | 2.0% | |
| p | 64489 | 2.0% | |
| f | 58055 | 1.8% | |
| y | 53920 | 1.7% | |
| w | 48144 | 1.5% | |
| v | 42251 | 1.3% | |
| b | 37889 | 1.2% | |
| k | 27676 | 0.9% | |
| x | 7132 | 0.2% | |
| j | 5207 | 0.2% | |
| z | 5049 | 0.2% | |
| Other values (464) | 14549 | 0.5% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 632325 | > 99.9% | ||
| 171 | < 0.1% | ||
| 4 | < 0.1% | ||
| 1 | < 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 64285 | 28.8% | |
| , | 51325 | 23.0% | |
| # | 31168 | 14.0% | |
| / | 22932 | 10.3% | |
| @ | 13062 | 5.8% | |
| & | 10046 | 4.5% | |
| : | 9711 | 4.3% | |
| ! | 6217 | 2.8% | |
| ' | 6004 | 2.7% | |
| ; | 2527 | 1.1% | |
| • | 2084 | 0.9% | |
| " | 961 | 0.4% | |
| * | 861 | 0.4% | |
| ? | 575 | 0.3% | |
| % | 419 | 0.2% | |
| । | 378 | 0.2% | |
| … | 193 | 0.1% | |
| \ | 109 | < 0.1% | |
| 。 | 92 | < 0.1% | |
| 、 | 87 | < 0.1% | |
| ، | 55 | < 0.1% | |
| ! | 49 | < 0.1% | |
| † | 45 | < 0.1% | |
| , | 33 | < 0.1% | |
| ॥ | 33 | < 0.1% | |
| Other values (22) | 163 | 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 7481 | 19.1% | |
| 2 | 5725 | 14.6% | |
| 9 | 5624 | 14.3% | |
| 0 | 5565 | 14.2% | |
| 6 | 3083 | 7.9% | |
| 4 | 2820 | 7.2% | |
| 3 | 2566 | 6.5% | |
| 5 | 2292 | 5.8% | |
| 8 | 2239 | 5.7% | |
| 7 | 1800 | 4.6% | |
| 𝟏 | 11 | < 0.1% | |
| 𝟗 | 5 | < 0.1% | |
| 𝟖 | 5 | < 0.1% | |
| ३ | 4 | < 0.1% | |
| २ | 2 | < 0.1% | |
| 𝟸 | 2 | < 0.1% | |
| 𝟺 | 2 | < 0.1% | |
| ۹ | 2 | < 0.1% | |
| 𝟚 | 2 | < 0.1% | |
| ౦ | 2 | < 0.1% | |
| 𝟐 | 1 | < 0.1% | |
| 𝟔 | 1 | < 0.1% | |
| ౨ | 1 | < 0.1% | |
| 𝟜 | 1 | < 0.1% | |
| 𝟘 | 1 | < 0.1% | |
| Other values (3) | 3 | < 0.1% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| 🇺 | 1144 | 3.5% | |
| 🇮 | 940 | 2.9% | |
| 🇳 | 865 | 2.6% | |
| ❤ | 840 | 2.6% | |
| 🇸 | 817 | 2.5% | |
| 🇪 | 663 | 2.0% | |
| 🌈 | 596 | 1.8% | |
| 🇦 | 533 | 1.6% | |
| 🇧 | 489 | 1.5% | |
| 🇨 | 469 | 1.4% | |
| 🇬 | 447 | 1.4% | |
| 🇷 | 428 | 1.3% | |
| 🏳 | 401 | 1.2% | |
| 🙏 | 379 | 1.2% | |
| 🚩 | 371 | 1.1% | |
| 💙 | 343 | 1.1% | |
| 🇵 | 325 | 1.0% | |
| 👩 | 301 | 0.9% | |
| 🇰 | 291 | 0.9% | |
| 📈 | 287 | 0.9% | |
| 🌲 | 279 | 0.9% | |
| 🇹 | 255 | 0.8% | |
| ♀ | 254 | 0.8% | |
| 🇭 | 251 | 0.8% | |
| 🇲 | 246 | 0.8% | |
| Other values (1139) | 20452 | 62.6% |
Most frequent Format characters
| Value | Count | Frequency (%) | |
| | 1353 | 49.7% | |
| | 226 | 8.3% | |
| | 205 | 7.5% | |
| | 170 | 6.2% | |
| | 168 | 6.2% | |
| | 113 | 4.2% | |
| | 71 | 2.6% | |
| | 70 | 2.6% | |
| | 65 | 2.4% | |
| | 56 | 2.1% | |
| | 56 | 2.1% | |
| | 52 | 1.9% | |
| | 43 | 1.6% | |
| | 42 | 1.5% | |
| | 18 | 0.7% | |
| | 6 | 0.2% | |
| | 6 | 0.2% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 14524 | 81.0% | |
| – | 3169 | 17.7% | |
| — | 199 | 1.1% | |
| ― | 19 | 0.1% | |
| ‑ | 10 | 0.1% | |
| 〰 | 5 | < 0.1% | |
| ‐ | 1 | < 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| | | 14158 | 82.9% | |
| + | 1185 | 6.9% | |
| ~ | 594 | 3.5% | |
| = | 564 | 3.3% | |
| ≠ | 447 | 2.6% | |
| ∆ | 14 | 0.1% | |
| ↓ | 13 | 0.1% | |
| ⤵ | 13 | 0.1% | |
| ∂ | 13 | 0.1% | |
| ◾ | 11 | 0.1% | |
| → | 7 | < 0.1% | |
| ≫ | 6 | < 0.1% | |
| × | 6 | < 0.1% | |
| ∙ | 6 | < 0.1% | |
| ∞ | 5 | < 0.1% | |
| | | 4 | < 0.1% | |
| ÷ | 3 | < 0.1% | |
| ⇔ | 3 | < 0.1% | |
| ~ | 3 | < 0.1% | |
| ≈ | 3 | < 0.1% | |
| ← | 3 | < 0.1% | |
| ↑ | 2 | < 0.1% | |
| ⋆ | 2 | < 0.1% | |
| ∣ | 2 | < 0.1% | |
| ⦁ | 2 | < 0.1% | |
| Other values (8) | 9 | 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 2528 | 99.8% | |
| ‿ | 4 | 0.2% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 18224 | 99.0% | ||
| 186 | 1.0% | ||
| 2 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 3212 | 95.3% | |
| [ | 47 | 1.4% | |
| { | 37 | 1.1% | |
| ( | 30 | 0.9% | |
| 「 | 18 | 0.5% | |
| 《 | 12 | 0.4% | |
| [ | 6 | 0.2% | |
| „ | 5 | 0.1% | |
| ⟬ | 4 | 0.1% | |
| ﴿ | 1 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 3348 | 95.4% | |
| ] | 49 | 1.4% | |
| } | 40 | 1.1% | |
| ) | 30 | 0.9% | |
| 」 | 18 | 0.5% | |
| 》 | 12 | 0.3% | |
| ] | 6 | 0.2% | |
| ⟭ | 4 | 0.1% | |
| ﴾ | 1 | < 0.1% |
Most frequent Final Punctuation characters
| Value | Count | Frequency (%) | |
| ’ | 2757 | 85.4% | |
| ” | 457 | 14.1% | |
| » | 16 | 0.5% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| 🏻 | 471 | 36.6% | |
| 🏼 | 288 | 22.4% | |
| 🏾 | 196 | 15.2% | |
| 🏽 | 193 | 15.0% | |
| 🏿 | 57 | 4.4% | |
| ^ | 44 | 3.4% | |
| ` | 16 | 1.2% | |
| ¯ | 9 | 0.7% | |
| ¸ | 5 | 0.4% | |
| ´ | 4 | 0.3% | |
| ¨ | 4 | 0.3% |
Most frequent Nonspacing Mark characters
| Value | Count | Frequency (%) | |
| ️ | 3888 | 36.8% | |
| ् | 1886 | 17.8% | |
| े | 1078 | 10.2% | |
| ं | 709 | 6.7% | |
| ͟ | 481 | 4.6% | |
| ु | 464 | 4.4% | |
| ் | 285 | 2.7% | |
| ै | 250 | 2.4% | |
| ू | 241 | 2.3% | |
| ್ | 96 | 0.9% | |
| ಿ | 90 | 0.9% | |
| َ | 74 | 0.7% | |
| ँ | 71 | 0.7% | |
| ृ | 64 | 0.6% | |
| ़ | 62 | 0.6% | |
| ్ | 52 | 0.5% | |
| ︎ | 51 | 0.5% | |
| ِ | 45 | 0.4% | |
| ّ | 39 | 0.4% | |
| ೆ | 37 | 0.4% | |
| ా | 37 | 0.4% | |
| ి | 36 | 0.3% | |
| ُ | 33 | 0.3% | |
| ্ | 25 | 0.2% | |
| ْ | 24 | 0.2% | |
| Other values (94) | 448 | 4.2% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| र | 1902 | 6.4% | |
| क | 1211 | 4.1% | |
| म | 1186 | 4.0% | |
| त | 1173 | 3.9% | |
| ا | 995 | 3.3% | |
| स | 947 | 3.2% | |
| ह | 903 | 3.0% | |
| न | 858 | 2.9% | |
| व | 698 | 2.3% | |
| य | 682 | 2.3% | |
| ل | 659 | 2.2% | |
| द | 624 | 2.1% | |
| ज | 478 | 1.6% | |
| ल | 436 | 1.5% | |
| ي | 426 | 1.4% | |
| प | 426 | 1.4% | |
| م | 402 | 1.4% | |
| ن | 381 | 1.3% | |
| ر | 376 | 1.3% | |
| و | 365 | 1.2% | |
| भ | 354 | 1.2% | |
| ष | 341 | 1.1% | |
| ग | 326 | 1.1% | |
| श | 311 | 1.0% | |
| ब | 269 | 0.9% | |
| Other values (1189) | 12995 | 43.7% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 1718 | 97.5% | |
| € | 21 | 1.2% | |
| £ | 12 | 0.7% | |
| ¢ | 5 | 0.3% | |
| ₿ | 2 | 0.1% | |
| ¤ | 2 | 0.1% | |
| ₩ | 1 | 0.1% | |
| ¥ | 1 | 0.1% |
Most frequent Spacing Mark characters
| Value | Count | Frequency (%) | |
| ा | 2061 | 36.9% | |
| ि | 1108 | 19.9% | |
| ी | 741 | 13.3% | |
| ो | 579 | 10.4% | |
| ி | 150 | 2.7% | |
| ः | 119 | 2.1% | |
| া | 79 | 1.4% | |
| ு | 69 | 1.2% | |
| ಾ | 68 | 1.2% | |
| ா | 65 | 1.2% | |
| ು | 53 | 0.9% | |
| ు | 47 | 0.8% | |
| ি | 39 | 0.7% | |
| ौ | 31 | 0.6% | |
| ே | 27 | 0.5% | |
| ಂ | 27 | 0.5% | |
| ை | 24 | 0.4% | |
| ೇ | 23 | 0.4% | |
| ॉ | 22 | 0.4% | |
| ਾ | 20 | 0.4% | |
| ం | 19 | 0.3% | |
| ে | 18 | 0.3% | |
| ோ | 17 | 0.3% | |
| ੀ | 16 | 0.3% | |
| ೂ | 14 | 0.3% | |
| Other values (35) | 144 | 2.6% |
Most frequent Initial Punctuation characters
| Value | Count | Frequency (%) | |
| “ | 449 | 64.9% | |
| ‘ | 227 | 32.8% | |
| « | 16 | 2.3% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ² | 10 | 62.5% | |
| ❾ | 2 | 12.5% | |
| ¾ | 2 | 12.5% | |
| ⅜ | 1 | 6.2% | |
| ⁴ | 1 | 6.2% |
Most frequent Modifier Letter characters
| Value | Count | Frequency (%) | |
| ー | 86 | 17.7% | |
| ᵃ | 38 | 7.8% | |
| ᵉ | 34 | 7.0% | |
| ـ | 32 | 6.6% | |
| ᵗ | 26 | 5.4% | |
| ⁱ | 26 | 5.4% | |
| ⁿ | 20 | 4.1% | |
| ʳ | 19 | 3.9% | |
| ˢ | 19 | 3.9% | |
| ᵈ | 18 | 3.7% | |
| ᵘ | 17 | 3.5% | |
| ᵒ | 16 | 3.3% | |
| ʰ | 14 | 2.9% | |
| ˡ | 12 | 2.5% | |
| ᶜ | 12 | 2.5% | |
| ᵏ | 10 | 2.1% | |
| ᶠ | 9 | 1.9% | |
| ᵐ | 7 | 1.4% | |
| ᵛ | 6 | 1.2% | |
| ᵖ | 5 | 1.0% | |
| ๆ | 4 | 0.8% | |
| ʲ | 4 | 0.8% | |
| ᵇ | 4 | 0.8% | |
| ₖ | 4 | 0.8% | |
| ₑ | 4 | 0.8% | |
| Other values (24) | 39 | 8.0% |
Most frequent Enclosing Mark characters
| Value | Count | Frequency (%) | |
| ⃣ | 28 | 53.8% | |
| ҉ | 24 | 46.2% |
Most frequent Private Use characters
| Value | Count | Frequency (%) | |
| | 19 | 70.4% | |
| | 3 | 11.1% | |
| | 3 | 11.1% | |
| | 2 | 7.4% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3602533 | 77.3% | |
| Common | 1001864 | 21.5% | |
| Devanagari | 24516 | 0.5% | |
| Arabic | 6577 | 0.1% | |
| Inherited | 6113 | 0.1% | |
| Cyrillic | 5747 | 0.1% | |
| Han | 3235 | 0.1% | |
| Tamil | 1696 | < 0.1% | |
| Kannada | 1177 | < 0.1% | |
| Greek | 1145 | < 0.1% | |
| Hiragana | 732 | < 0.1% | |
| Telugu | 596 | < 0.1% | |
| Bengali | 546 | < 0.1% | |
| Thai | 545 | < 0.1% | |
| Katakana | 458 | < 0.1% | |
| Hebrew | 198 | < 0.1% | |
| Gurmukhi | 195 | < 0.1% | |
| Canadian_Aboriginal | 143 | < 0.1% | |
| Malayalam | 116 | < 0.1% | |
| Oriya | 109 | < 0.1% | |
| Sinhala | 102 | < 0.1% | |
| Gujarati | 97 | < 0.1% | |
| Hangul | 87 | < 0.1% | |
| Braille | 40 | < 0.1% | |
| Armenian | 34 | < 0.1% | |
| Other values (17) | 100 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 366512 | 10.2% | |
| a | 275198 | 7.6% | |
| i | 266213 | 7.4% | |
| n | 264026 | 7.3% | |
| o | 261927 | 7.3% | |
| t | 259416 | 7.2% | |
| r | 223533 | 6.2% | |
| s | 211119 | 5.9% | |
| l | 147031 | 4.1% | |
| c | 111552 | 3.1% | |
| d | 110103 | 3.1% | |
| h | 99119 | 2.8% | |
| u | 89833 | 2.5% | |
| m | 78464 | 2.2% | |
| g | 65123 | 1.8% | |
| p | 64489 | 1.8% | |
| f | 58055 | 1.6% | |
| y | 53920 | 1.5% | |
| w | 48144 | 1.3% | |
| v | 42251 | 1.2% | |
| b | 37889 | 1.1% | |
| C | 31995 | 0.9% | |
| S | 30556 | 0.8% | |
| A | 29590 | 0.8% | |
| T | 29080 | 0.8% | |
| Other values (224) | 347395 | 9.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 632325 | 63.1% | ||
| . | 64285 | 6.4% | |
| , | 51325 | 5.1% | |
| # | 31168 | 3.1% | |
| / | 22932 | 2.3% | |
| 18224 | 1.8% | ||
| - | 14524 | 1.4% | |
| | | 14158 | 1.4% | |
| @ | 13062 | 1.3% | |
| & | 10046 | 1.0% | |
| : | 9711 | 1.0% | |
| 1 | 7481 | 0.7% | |
| ! | 6217 | 0.6% | |
| ' | 6004 | 0.6% | |
| 2 | 5725 | 0.6% | |
| 9 | 5624 | 0.6% | |
| 0 | 5565 | 0.6% | |
| ) | 3348 | 0.3% | |
| ( | 3212 | 0.3% | |
| – | 3169 | 0.3% | |
| 6 | 3083 | 0.3% | |
| 4 | 2820 | 0.3% | |
| ’ | 2757 | 0.3% | |
| 3 | 2566 | 0.3% | |
| _ | 2528 | 0.3% | |
| Other values (1689) | 60005 | 6.0% |
Most frequent Inherited characters
| Value | Count | Frequency (%) | |
| ️ | 3888 | 63.6% | |
| | 1353 | 22.1% | |
| ͟ | 481 | 7.9% | |
| َ | 74 | 1.2% | |
| ︎ | 51 | 0.8% | |
| ِ | 45 | 0.7% | |
| ّ | 39 | 0.6% | |
| ُ | 33 | 0.5% | |
| ⃣ | 28 | 0.5% | |
| ْ | 24 | 0.4% | |
| ̶ | 13 | 0.2% | |
| ̞ | 10 | 0.2% | |
| ً | 9 | 0.1% | |
| ̈ | 8 | 0.1% | |
| | 6 | 0.1% | |
| ٰ | 6 | 0.1% | |
| ͌ | 4 | 0.1% | |
| ̽ | 4 | 0.1% | |
| ٍ | 2 | < 0.1% | |
| ́ | 2 | < 0.1% | |
| ͛ | 2 | < 0.1% | |
| ͖ | 2 | < 0.1% | |
| ̆ | 2 | < 0.1% | |
| ̂ | 2 | < 0.1% | |
| ͚ | 2 | < 0.1% | |
| Other values (17) | 23 | 0.4% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 995 | 15.1% | |
| ل | 659 | 10.0% | |
| ي | 426 | 6.5% | |
| م | 402 | 6.1% | |
| ن | 381 | 5.8% | |
| ر | 376 | 5.7% | |
| و | 365 | 5.5% | |
| ت | 256 | 3.9% | |
| ب | 223 | 3.4% | |
| ع | 217 | 3.3% | |
| س | 214 | 3.3% | |
| د | 214 | 3.3% | |
| ة | 191 | 2.9% | |
| ف | 127 | 1.9% | |
| ح | 126 | 1.9% | |
| ه | 124 | 1.9% | |
| ق | 111 | 1.7% | |
| ك | 108 | 1.6% | |
| ی | 106 | 1.6% | |
| ش | 81 | 1.2% | |
| خ | 80 | 1.2% | |
| ج | 71 | 1.1% | |
| أ | 71 | 1.1% | |
| ط | 69 | 1.0% | |
| ز | 66 | 1.0% | |
| Other values (30) | 518 | 7.9% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| ा | 2061 | 8.4% | |
| र | 1902 | 7.8% | |
| ् | 1886 | 7.7% | |
| क | 1211 | 4.9% | |
| म | 1186 | 4.8% | |
| त | 1173 | 4.8% | |
| ि | 1108 | 4.5% | |
| े | 1078 | 4.4% | |
| स | 947 | 3.9% | |
| ह | 903 | 3.7% | |
| न | 858 | 3.5% | |
| ी | 741 | 3.0% | |
| ं | 709 | 2.9% | |
| व | 698 | 2.8% | |
| य | 682 | 2.8% | |
| द | 624 | 2.5% | |
| ो | 579 | 2.4% | |
| ज | 478 | 1.9% | |
| ु | 464 | 1.9% | |
| ल | 436 | 1.8% | |
| प | 426 | 1.7% | |
| भ | 354 | 1.4% | |
| ष | 341 | 1.4% | |
| ग | 326 | 1.3% | |
| श | 311 | 1.3% | |
| Other values (47) | 3034 | 12.4% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| и | 532 | 9.3% | |
| с | 473 | 8.2% | |
| о | 443 | 7.7% | |
| а | 422 | 7.3% | |
| е | 333 | 5.8% | |
| н | 314 | 5.5% | |
| р | 252 | 4.4% | |
| т | 249 | 4.3% | |
| к | 242 | 4.2% | |
| л | 220 | 3.8% | |
| в | 212 | 3.7% | |
| у | 153 | 2.7% | |
| д | 152 | 2.6% | |
| м | 126 | 2.2% | |
| ь | 120 | 2.1% | |
| я | 111 | 1.9% | |
| п | 97 | 1.7% | |
| з | 96 | 1.7% | |
| й | 95 | 1.7% | |
| П | 89 | 1.5% | |
| ы | 87 | 1.5% | |
| х | 82 | 1.4% | |
| Р | 72 | 1.3% | |
| ю | 69 | 1.2% | |
| ц | 61 | 1.1% | |
| Other values (42) | 645 | 11.2% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| イ | 53 | 11.6% | |
| ン | 33 | 7.2% | |
| ド | 29 | 6.3% | |
| ス | 28 | 6.1% | |
| ツ | 28 | 6.1% | |
| ッ | 20 | 4.4% | |
| バ | 18 | 3.9% | |
| フ | 15 | 3.3% | |
| コ | 14 | 3.1% | |
| タ | 14 | 3.1% | |
| ォ | 14 | 3.1% | |
| ロ | 13 | 2.8% | |
| ビ | 13 | 2.8% | |
| ア | 12 | 2.6% | |
| オ | 12 | 2.6% | |
| サ | 11 | 2.4% | |
| ト | 11 | 2.4% | |
| ク | 8 | 1.7% | |
| シ | 7 | 1.5% | |
| カ | 7 | 1.5% | |
| ヌ | 7 | 1.5% | |
| メ | 6 | 1.3% | |
| ャ | 6 | 1.3% | |
| ウ | 6 | 1.3% | |
| ル | 5 | 1.1% | |
| Other values (28) | 68 | 14.8% |
Most frequent Han characters
| Value | Count | Frequency (%) | |
| 研 | 60 | 1.9% | |
| 人 | 58 | 1.8% | |
| 新 | 57 | 1.8% | |
| 学 | 56 | 1.7% | |
| 国 | 53 | 1.6% | |
| 究 | 49 | 1.5% | |
| 大 | 44 | 1.4% | |
| 中 | 42 | 1.3% | |
| 港 | 39 | 1.2% | |
| 師 | 34 | 1.1% | |
| 実 | 33 | 1.0% | |
| 京 | 31 | 1.0% | |
| 日 | 30 | 0.9% | |
| 立 | 29 | 0.9% | |
| 我 | 29 | 0.9% | |
| 香 | 29 | 0.9% | |
| 命 | 27 | 0.8% | |
| 本 | 27 | 0.8% | |
| 語 | 26 | 0.8% | |
| 革 | 25 | 0.8% | |
| 民 | 25 | 0.8% | |
| 和 | 25 | 0.8% | |
| 报 | 24 | 0.7% | |
| 共 | 24 | 0.7% | |
| 主 | 24 | 0.7% | |
| Other values (547) | 2335 | 72.2% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| の | 56 | 7.7% | |
| て | 46 | 6.3% | |
| で | 45 | 6.1% | |
| は | 44 | 6.0% | |
| し | 40 | 5.5% | |
| い | 40 | 5.5% | |
| た | 39 | 5.3% | |
| を | 35 | 4.8% | |
| り | 34 | 4.6% | |
| っ | 32 | 4.4% | |
| な | 29 | 4.0% | |
| と | 22 | 3.0% | |
| る | 22 | 3.0% | |
| す | 20 | 2.7% | |
| に | 19 | 2.6% | |
| せ | 17 | 2.3% | |
| き | 16 | 2.2% | |
| あ | 15 | 2.0% | |
| ら | 14 | 1.9% | |
| ま | 14 | 1.9% | |
| ね | 11 | 1.5% | |
| が | 11 | 1.5% | |
| く | 11 | 1.5% | |
| れ | 8 | 1.1% | |
| も | 8 | 1.1% | |
| Other values (26) | 84 | 11.5% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 리 | 3 | 3.4% | |
| 뉴 | 3 | 3.4% | |
| 스 | 3 | 3.4% | |
| 신 | 2 | 2.3% | |
| 의 | 2 | 2.3% | |
| 어 | 2 | 2.3% | |
| 트 | 2 | 2.3% | |
| 기 | 2 | 2.3% | |
| 아 | 2 | 2.3% | |
| 이 | 2 | 2.3% | |
| ㅣ | 2 | 2.3% | |
| 평 | 1 | 1.1% | |
| 일 | 1 | 1.1% | |
| 오 | 1 | 1.1% | |
| 전 | 1 | 1.1% | |
| 헨 | 1 | 1.1% | |
| 영 | 1 | 1.1% | |
| 시 | 1 | 1.1% | |
| 사 | 1 | 1.1% | |
| 팩 | 1 | 1.1% | |
| 체 | 1 | 1.1% | |
| 크 | 1 | 1.1% | |
| 한 | 1 | 1.1% | |
| 국 | 1 | 1.1% | |
| 담 | 1 | 1.1% | |
| Other values (48) | 48 | 55.2% |
Most frequent Greek characters
| Value | Count | Frequency (%) | |
| ε | 89 | 7.8% | |
| π | 76 | 6.6% | |
| α | 74 | 6.5% | |
| ο | 70 | 6.1% | |
| ι | 69 | 6.0% | |
| ς | 68 | 5.9% | |
| σ | 66 | 5.8% | |
| ν | 53 | 4.6% | |
| ρ | 48 | 4.2% | |
| υ | 38 | 3.3% | |
| ω | 38 | 3.3% | |
| κ | 31 | 2.7% | |
| η | 29 | 2.5% | |
| θ | 29 | 2.5% | |
| μ | 24 | 2.1% | |
| τ | 22 | 1.9% | |
| Φ | 21 | 1.8% | |
| Δ | 20 | 1.7% | |
| λ | 17 | 1.5% | |
| Θ | 16 | 1.4% | |
| Σ | 15 | 1.3% | |
| ό | 15 | 1.3% | |
| έ | 15 | 1.3% | |
| γ | 14 | 1.2% | |
| Ω | 13 | 1.1% | |
| Other values (42) | 175 | 15.3% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 285 | 16.8% | |
| ம | 164 | 9.7% | |
| ி | 150 | 8.8% | |
| த | 141 | 8.3% | |
| க | 97 | 5.7% | |
| ல | 92 | 5.4% | |
| ன | 76 | 4.5% | |
| ழ | 73 | 4.3% | |
| ு | 69 | 4.1% | |
| ா | 65 | 3.8% | |
| ப | 48 | 2.8% | |
| ந | 45 | 2.7% | |
| ர | 44 | 2.6% | |
| வ | 44 | 2.6% | |
| ய | 36 | 2.1% | |
| ட | 33 | 1.9% | |
| ச | 33 | 1.9% | |
| இ | 29 | 1.7% | |
| ே | 27 | 1.6% | |
| ை | 24 | 1.4% | |
| ோ | 17 | 1.0% | |
| ற | 14 | 0.8% | |
| ண | 14 | 0.8% | |
| ெ | 12 | 0.7% | |
| உ | 11 | 0.6% | |
| Other values (14) | 53 | 3.1% |
Most frequent Unknown characters
| Value | Count | Frequency (%) | |
| | 19 | 70.4% | |
| | 3 | 11.1% | |
| | 3 | 11.1% | |
| | 2 | 7.4% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠀ | 40 | 100.0% |
Most frequent Egyptian_Hieroglyphs characters
| Value | Count | Frequency (%) | |
| 𓅓 | 1 | 33.3% | |
| 𓆉 | 1 | 33.3% | |
| 𓇽 | 1 | 33.3% |
Most frequent Old_Turkic characters
| Value | Count | Frequency (%) | |
| 𐰀 | 3 | 15.8% | |
| 𐰤 | 2 | 10.5% | |
| 𐰢 | 2 | 10.5% | |
| 𐰆 | 2 | 10.5% | |
| 𐰇 | 2 | 10.5% | |
| 𐱃 | 1 | 5.3% | |
| 𐰞 | 1 | 5.3% | |
| 𐱅 | 1 | 5.3% | |
| 𐰼 | 1 | 5.3% | |
| 𐰚 | 1 | 5.3% | |
| 𐰓 | 1 | 5.3% | |
| 𐰃 | 1 | 5.3% | |
| 𐰘 | 1 | 5.3% |
Most frequent Oriya characters
| Value | Count | Frequency (%) | |
| ି | 10 | 9.2% | |
| ା | 8 | 7.3% | |
| ଆ | 7 | 6.4% | |
| ୍ | 7 | 6.4% | |
| ପ | 6 | 5.5% | |
| ର | 6 | 5.5% | |
| ଲ | 4 | 3.7% | |
| ଓ | 4 | 3.7% | |
| ଡ | 4 | 3.7% | |
| ଼ | 4 | 3.7% | |
| ନ | 4 | 3.7% | |
| ସ | 3 | 2.8% | |
| ଖ | 3 | 2.8% | |
| ୀ | 3 | 2.8% | |
| ଣ | 3 | 2.8% | |
| ମ | 3 | 2.8% | |
| ବ | 3 | 2.8% | |
| ତ | 2 | 1.8% | |
| େ | 2 | 1.8% | |
| କ | 2 | 1.8% | |
| ଶ | 2 | 1.8% | |
| ୱ | 2 | 1.8% | |
| ୁ | 2 | 1.8% | |
| ଦ | 2 | 1.8% | |
| ଷ | 2 | 1.8% | |
| Other values (11) | 11 | 10.1% |
Most frequent Bengali characters
| Value | Count | Frequency (%) | |
| া | 79 | 14.5% | |
| ি | 39 | 7.1% | |
| র | 37 | 6.8% | |
| ব | 34 | 6.2% | |
| ্ | 25 | 4.6% | |
| ন | 25 | 4.6% | |
| ত | 22 | 4.0% | |
| স | 19 | 3.5% | |
| দ | 18 | 3.3% | |
| ে | 18 | 3.3% | |
| ক | 18 | 3.3% | |
| প | 14 | 2.6% | |
| হ | 14 | 2.6% | |
| ল | 14 | 2.6% | |
| জ | 13 | 2.4% | |
| ু | 12 | 2.2% | |
| ী | 11 | 2.0% | |
| ই | 11 | 2.0% | |
| অ | 10 | 1.8% | |
| য় | 9 | 1.6% | |
| ভ | 9 | 1.6% | |
| ঙ | 8 | 1.5% | |
| ম | 8 | 1.5% | |
| উ | 7 | 1.3% | |
| য | 7 | 1.3% | |
| Other values (20) | 65 | 11.9% |
Most frequent Cuneiform characters
| Value | Count | Frequency (%) | |
| 𒀭 | 2 | 100.0% |
Most frequent Sinhala characters
| Value | Count | Frequency (%) | |
| ි | 12 | 11.8% | |
| ව | 8 | 7.8% | |
| ් | 7 | 6.9% | |
| ු | 6 | 5.9% | |
| ෙ | 6 | 5.9% | |
| ය | 6 | 5.9% | |
| ල | 5 | 4.9% | |
| න | 5 | 4.9% | |
| ප | 4 | 3.9% | |
| ක | 4 | 3.9% | |
| ම | 4 | 3.9% | |
| ා | 3 | 2.9% | |
| ද | 3 | 2.9% | |
| හ | 3 | 2.9% | |
| ඩ | 3 | 2.9% | |
| ස | 2 | 2.0% | |
| බ | 2 | 2.0% | |
| ර | 2 | 2.0% | |
| ඒ | 2 | 2.0% | |
| උ | 2 | 2.0% | |
| ඟ | 2 | 2.0% | |
| ො | 1 | 1.0% | |
| ට | 1 | 1.0% | |
| ත | 1 | 1.0% | |
| එ | 1 | 1.0% | |
| Other values (7) | 7 | 6.9% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| า | 39 | 7.2% | |
| ม | 37 | 6.8% | |
| ร | 32 | 5.9% | |
| น | 28 | 5.1% | |
| ่ | 24 | 4.4% | |
| อ | 21 | 3.9% | |
| ก | 20 | 3.7% | |
| ั | 18 | 3.3% | |
| ค | 18 | 3.3% | |
| เ | 18 | 3.3% | |
| ย | 17 | 3.1% | |
| ไ | 16 | 2.9% | |
| ิ | 14 | 2.6% | |
| ้ | 14 | 2.6% | |
| ี | 12 | 2.2% | |
| ต | 12 | 2.2% | |
| ง | 12 | 2.2% | |
| ์ | 11 | 2.0% | |
| ว | 11 | 2.0% | |
| ท | 11 | 2.0% | |
| ็ | 11 | 2.0% | |
| บ | 11 | 2.0% | |
| ภ | 10 | 1.8% | |
| ป | 9 | 1.7% | |
| ล | 9 | 1.7% | |
| Other values (28) | 110 | 20.2% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ನ | 107 | 9.1% | |
| ್ | 96 | 8.2% | |
| ಿ | 90 | 7.6% | |
| ಕ | 80 | 6.8% | |
| ರ | 80 | 6.8% | |
| ಾ | 68 | 5.8% | |
| ು | 53 | 4.5% | |
| ಮ | 50 | 4.2% | |
| ಯ | 48 | 4.1% | |
| ಗ | 44 | 3.7% | |
| ದ | 43 | 3.7% | |
| ತ | 41 | 3.5% | |
| ೆ | 37 | 3.1% | |
| ಡ | 28 | 2.4% | |
| ಂ | 27 | 2.3% | |
| ಜ | 27 | 2.3% | |
| ಳ | 25 | 2.1% | |
| ೇ | 23 | 2.0% | |
| ಹ | 21 | 1.8% | |
| ಪ | 19 | 1.6% | |
| ವ | 19 | 1.6% | |
| ಟ | 17 | 1.4% | |
| ೂ | 14 | 1.2% | |
| ಅ | 14 | 1.2% | |
| ಚ | 13 | 1.1% | |
| Other values (18) | 93 | 7.9% |
Most frequent Hebrew characters
| Value | Count | Frequency (%) | |
| א | 22 | 11.1% | |
| ו | 18 | 9.1% | |
| י | 16 | 8.1% | |
| ל | 16 | 8.1% | |
| ר | 15 | 7.6% | |
| ש | 13 | 6.6% | |
| ת | 9 | 4.5% | |
| ה | 8 | 4.0% | |
| ְ | 8 | 4.0% | |
| ָ | 8 | 4.0% | |
| ע | 6 | 3.0% | |
| מ | 6 | 3.0% | |
| ב | 6 | 3.0% | |
| ד | 5 | 2.5% | |
| ג | 5 | 2.5% | |
| ֶ | 5 | 2.5% | |
| ך | 4 | 2.0% | |
| כ | 3 | 1.5% | |
| ם | 3 | 1.5% | |
| ּ | 3 | 1.5% | |
| ַ | 3 | 1.5% | |
| נ | 2 | 1.0% | |
| ח | 2 | 1.0% | |
| צ | 2 | 1.0% | |
| ף | 2 | 1.0% | |
| Other values (6) | 8 | 4.0% |
Most frequent Gurmukhi characters
| Value | Count | Frequency (%) | |
| ਾ | 20 | 10.3% | |
| ੀ | 16 | 8.2% | |
| ਸ | 14 | 7.2% | |
| ਂ | 9 | 4.6% | |
| ਕ | 9 | 4.6% | |
| ਰ | 8 | 4.1% | |
| ਜ | 7 | 3.6% | |
| ਬ | 7 | 3.6% | |
| ਤ | 7 | 3.6% | |
| ਦ | 7 | 3.6% | |
| ਵ | 7 | 3.6% | |
| ੁ | 6 | 3.1% | |
| ਨ | 6 | 3.1% | |
| ਹ | 6 | 3.1% | |
| ਗ | 5 | 2.6% | |
| ਿ | 5 | 2.6% | |
| ਼ | 5 | 2.6% | |
| ਲ | 4 | 2.1% | |
| ਪ | 4 | 2.1% | |
| ੰ | 4 | 2.1% | |
| ੇ | 4 | 2.1% | |
| ਝ | 4 | 2.1% | |
| ਮ | 4 | 2.1% | |
| ੋ | 4 | 2.1% | |
| ਅ | 2 | 1.0% | |
| Other values (17) | 21 | 10.8% |
Most frequent Georgian characters
| Value | Count | Frequency (%) | |
| ღ | 7 | 70.0% | |
| ყ | 2 | 20.0% | |
| Ⴆ | 1 | 10.0% |
Most frequent Gujarati characters
| Value | Count | Frequency (%) | |
| ા | 12 | 12.4% | |
| ર | 7 | 7.2% | |
| ી | 7 | 7.2% | |
| ુ | 6 | 6.2% | |
| વ | 6 | 6.2% | |
| ં | 6 | 6.2% | |
| ત | 5 | 5.2% | |
| ે | 5 | 5.2% | |
| મ | 5 | 5.2% | |
| ગ | 4 | 4.1% | |
| જ | 3 | 3.1% | |
| ્ | 3 | 3.1% | |
| બ | 3 | 3.1% | |
| ો | 3 | 3.1% | |
| ક | 3 | 3.1% | |
| છ | 2 | 2.1% | |
| લ | 2 | 2.1% | |
| ચ | 2 | 2.1% | |
| ણ | 2 | 2.1% | |
| હ | 1 | 1.0% | |
| ખ | 1 | 1.0% | |
| થ | 1 | 1.0% | |
| ળ | 1 | 1.0% | |
| િ | 1 | 1.0% | |
| અ | 1 | 1.0% | |
| Other values (5) | 5 | 5.2% |
Most frequent Malayalam characters
| Value | Count | Frequency (%) | |
| ് | 14 | 12.1% | |
| ക | 10 | 8.6% | |
| ന | 10 | 8.6% | |
| ത | 9 | 7.8% | |
| ി | 7 | 6.0% | |
| ാ | 6 | 5.2% | |
| ു | 5 | 4.3% | |
| മ | 4 | 3.4% | |
| ര | 4 | 3.4% | |
| യ | 4 | 3.4% | |
| ശ | 4 | 3.4% | |
| ീ | 3 | 2.6% | |
| െ | 3 | 2.6% | |
| ങ | 3 | 2.6% | |
| ൻ | 3 | 2.6% | |
| പ | 3 | 2.6% | |
| ച | 2 | 1.7% | |
| ഭ | 2 | 1.7% | |
| ൃ | 2 | 1.7% | |
| ൂ | 2 | 1.7% | |
| ർ | 2 | 1.7% | |
| ള | 2 | 1.7% | |
| ൽ | 1 | 0.9% | |
| ൊ | 1 | 0.9% | |
| വ | 1 | 0.9% | |
| Other values (9) | 9 | 7.8% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| ్ | 52 | 8.7% | |
| ు | 47 | 7.9% | |
| ా | 37 | 6.2% | |
| ి | 36 | 6.0% | |
| న | 34 | 5.7% | |
| ల | 31 | 5.2% | |
| త | 31 | 5.2% | |
| క | 28 | 4.7% | |
| మ | 27 | 4.5% | |
| ర | 27 | 4.5% | |
| వ | 20 | 3.4% | |
| ం | 19 | 3.2% | |
| గ | 15 | 2.5% | |
| స | 15 | 2.5% | |
| ప | 15 | 2.5% | |
| ె | 15 | 2.5% | |
| ట | 13 | 2.2% | |
| ో | 13 | 2.2% | |
| య | 12 | 2.0% | |
| ే | 11 | 1.8% | |
| అ | 10 | 1.7% | |
| చ | 10 | 1.7% | |
| ష | 9 | 1.5% | |
| డ | 9 | 1.5% | |
| భ | 8 | 1.3% | |
| Other values (21) | 52 | 8.7% |
Most frequent Tagalog characters
| Value | Count | Frequency (%) | |
| ᜉ | 1 | 25.0% | |
| ᜂ | 1 | 25.0% | |
| ᜎ | 1 | 25.0% | |
| ᜓ | 1 | 25.0% |
Most frequent Canadian_Aboriginal characters
| Value | Count | Frequency (%) | |
| ᗩ | 23 | 16.1% | |
| ᑎ | 17 | 11.9% | |
| ᔕ | 16 | 11.2% | |
| ᒪ | 15 | 10.5% | |
| ᖇ | 14 | 9.8% | |
| ᗰ | 10 | 7.0% | |
| ᑕ | 9 | 6.3% | |
| ᗪ | 8 | 5.6% | |
| ᐯ | 6 | 4.2% | |
| ᑭ | 5 | 3.5% | |
| ᑌ | 4 | 2.8% | |
| ᗯ | 2 | 1.4% | |
| ᕼ | 2 | 1.4% | |
| ᗷ | 2 | 1.4% | |
| ᗴ | 2 | 1.4% | |
| ᐃ | 2 | 1.4% | |
| ᒍ | 1 | 0.7% | |
| ᖴ | 1 | 0.7% | |
| ᖃ | 1 | 0.7% | |
| ᓗ | 1 | 0.7% | |
| ᑦ | 1 | 0.7% | |
| ᗡ | 1 | 0.7% |
Most frequent Coptic characters
| Value | Count | Frequency (%) | |
| Ⲁ | 2 | 28.6% | |
| Ⲍ | 2 | 28.6% | |
| Ⲧ | 2 | 28.6% | |
| ⲁ | 1 | 14.3% |
Most frequent Tibetan characters
| Value | Count | Frequency (%) | |
| ༒ | 4 | 100.0% |
Most frequent Javanese characters
| Value | Count | Frequency (%) | |
| ꧁ | 2 | 50.0% | |
| ꧂ | 2 | 50.0% |
Most frequent Ol_Chiki characters
| Value | Count | Frequency (%) | |
| ᱛ | 2 | 100.0% |
Most frequent Cherokee characters
| Value | Count | Frequency (%) | |
| Ꮹ | 2 | 40.0% | |
| Ᏻ | 1 | 20.0% | |
| Ꮃ | 1 | 20.0% | |
| Ꭹ | 1 | 20.0% |
Most frequent Armenian characters
| Value | Count | Frequency (%) | |
| ա | 8 | 23.5% | |
| կ | 4 | 11.8% | |
| ե | 3 | 8.8% | |
| տ | 2 | 5.9% | |
| ն | 2 | 5.9% | |
| ր | 2 | 5.9% | |
| Ք | 1 | 2.9% | |
| ո | 1 | 2.9% | |
| ղ | 1 | 2.9% | |
| վ | 1 | 2.9% | |
| զ | 1 | 2.9% | |
| ը | 1 | 2.9% | |
| ֍ | 1 | 2.9% | |
| ֎ | 1 | 2.9% | |
| Փ | 1 | 2.9% | |
| ս | 1 | 2.9% | |
| չ | 1 | 2.9% | |
| հ | 1 | 2.9% | |
| ի | 1 | 2.9% |
Most frequent Runic characters
| Value | Count | Frequency (%) | |
| ᛉ | 1 | 50.0% | |
| ᛟ | 1 | 50.0% |
Most frequent Tifinagh characters
| Value | Count | Frequency (%) | |
| ⵣ | 1 | 100.0% |
Most frequent Ethiopic characters
| Value | Count | Frequency (%) | |
| ል | 1 | 16.7% | |
| ማ | 1 | 16.7% | |
| ት | 1 | 16.7% | |
| ፖ | 1 | 16.7% | |
| ሊ | 1 | 16.7% | |
| ሲ | 1 | 16.7% |
Most frequent Lao characters
| Value | Count | Frequency (%) | |
| ໒ | 1 | 50.0% | |
| ຮ | 1 | 50.0% |
Most frequent Yi characters
| Value | Count | Frequency (%) | |
| ꒱ | 1 | 100.0% |
Most frequent Bamum characters
| Value | Count | Frequency (%) | |
| 𖥻 | 1 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4551183 | 97.7% | |
| Devanagari | 24927 | 0.5% | |
| None | 19041 | 0.4% | |
| Punctuation | 11310 | 0.2% | |
| Enclosed Alphanum Sup | 9269 | 0.2% | |
| Arabic | 6896 | 0.1% | |
| Cyrillic | 5747 | 0.1% | |
| VS | 3939 | 0.1% | |
| CJK | 3235 | 0.1% | |
| Math Alphanum | 2807 | 0.1% | |
| Latin 1 Sup | 2470 | 0.1% | |
| Misc Symbols | 2439 | 0.1% | |
| Dingbats | 2248 | < 0.1% | |
| Tamil | 1696 | < 0.1% | |
| Emoticons | 1524 | < 0.1% | |
| Kannada | 1177 | < 0.1% | |
| Tags | 1015 | < 0.1% | |
| Phonetic Ext | 771 | < 0.1% | |
| Hiragana | 732 | < 0.1% | |
| Telugu | 596 | < 0.1% | |
| Katakana | 568 | < 0.1% | |
| Diacriticals | 551 | < 0.1% | |
| Bengali | 546 | < 0.1% | |
| Thai | 545 | < 0.1% | |
| Math Operators | 501 | < 0.1% | |
| Other values (54) | 2968 | 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 632325 | 13.9% | ||
| e | 366512 | 8.1% | |
| a | 275198 | 6.0% | |
| i | 266213 | 5.8% | |
| n | 264026 | 5.8% | |
| o | 261927 | 5.8% | |
| t | 259416 | 5.7% | |
| r | 223533 | 4.9% | |
| s | 211119 | 4.6% | |
| l | 147031 | 3.2% | |
| c | 111552 | 2.5% | |
| d | 110103 | 2.4% | |
| h | 99119 | 2.2% | |
| u | 89833 | 2.0% | |
| m | 78464 | 1.7% | |
| g | 65123 | 1.4% | |
| p | 64489 | 1.4% | |
| . | 64285 | 1.4% | |
| f | 58055 | 1.3% | |
| y | 53920 | 1.2% | |
| , | 51325 | 1.1% | |
| w | 48144 | 1.1% | |
| v | 42251 | 0.9% | |
| b | 37889 | 0.8% | |
| C | 31995 | 0.7% | |
| Other values (71) | 637336 | 14.0% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| 🌈 | 596 | 3.1% | |
| 🏻 | 471 | 2.5% | |
| 🏳 | 401 | 2.1% | |
| 🚩 | 371 | 1.9% | |
| 💙 | 343 | 1.8% | |
| 👩 | 301 | 1.6% | |
| 🏼 | 288 | 1.5% | |
| 📈 | 287 | 1.5% | |
| 🌲 | 279 | 1.5% | |
| 🌊 | 241 | 1.3% | |
| 👨 | 225 | 1.2% | |
| 🏾 | 196 | 1.0% | |
| 🏴 | 195 | 1.0% | |
| 🏽 | 193 | 1.0% | |
| 💚 | 183 | 1.0% | |
| 🎓 | 169 | 0.9% | |
| 🌴 | 160 | 0.8% | |
| 💜 | 155 | 0.8% | |
| 📚 | 153 | 0.8% | |
| 💻 | 144 | 0.8% | |
| 🚫 | 143 | 0.8% | |
| 💯 | 141 | 0.7% | |
| 🎶 | 140 | 0.7% | |
| 🌎 | 132 | 0.7% | |
| 🔥 | 114 | 0.6% | |
| Other values (878) | 13020 | 68.4% |
Most frequent Punctuation characters
| Value | Count | Frequency (%) | |
| – | 3169 | 28.0% | |
| ’ | 2757 | 24.4% | |
| • | 2084 | 18.4% | |
| | 1353 | 12.0% | |
| ” | 457 | 4.0% | |
| “ | 449 | 4.0% | |
| ‘ | 227 | 2.0% | |
| | 205 | 1.8% | |
| — | 199 | 1.8% | |
| … | 193 | 1.7% | |
| | 65 | 0.6% | |
| † | 45 | 0.4% | |
| ‼ | 25 | 0.2% | |
| ― | 19 | 0.2% | |
| | 18 | 0.2% | |
| ‑ | 10 | 0.1% | |
| | 6 | 0.1% | |
| | 6 | 0.1% | |
| ‧ | 5 | < 0.1% | |
| „ | 5 | < 0.1% | |
| ‿ | 4 | < 0.1% | |
| ※ | 3 | < 0.1% | |
| ′ | 2 | < 0.1% | |
| ″ | 2 | < 0.1% | |
| ‐ | 1 | < 0.1% |
Most frequent Misc Symbols characters
| Value | Count | Frequency (%) | |
| ♀ | 254 | 10.4% | |
| ☀ | 173 | 7.1% | |
| ♥ | 171 | 7.0% | |
| ⚕ | 171 | 7.0% | |
| ♂ | 139 | 5.7% | |
| ⚽ | 109 | 4.5% | |
| ☕ | 102 | 4.2% | |
| ⚡ | 98 | 4.0% | |
| ⚖ | 74 | 3.0% | |
| ♡ | 69 | 2.8% | |
| ☮ | 66 | 2.7% | |
| ⛔ | 49 | 2.0% | |
| ⚾ | 43 | 1.8% | |
| ♻ | 37 | 1.5% | |
| ★ | 33 | 1.4% | |
| ☺ | 32 | 1.3% | |
| ♟ | 31 | 1.3% | |
| ☯ | 30 | 1.2% | |
| ⚧ | 29 | 1.2% | |
| ☠ | 28 | 1.1% | |
| ⚜ | 27 | 1.1% | |
| ☆ | 27 | 1.1% | |
| ⚠ | 24 | 1.0% | |
| ♏ | 24 | 1.0% | |
| ♬ | 23 | 0.9% | |
| Other values (80) | 576 | 23.6% |
Most frequent VS characters
| Value | Count | Frequency (%) | |
| ️ | 3888 | 98.7% | |
| ︎ | 51 | 1.3% |
Most frequent Dingbats characters
| Value | Count | Frequency (%) | |
| ❤ | 840 | 37.4% | |
| ✨ | 216 | 9.6% | |
| ✊ | 178 | 7.9% | |
| ✈ | 177 | 7.9% | |
| ✌ | 154 | 6.9% | |
| ➡ | 123 | 5.5% | |
| ✍ | 122 | 5.4% | |
| ✝ | 85 | 3.8% | |
| ✋ | 59 | 2.6% | |
| ❌ | 52 | 2.3% | |
| ❄ | 46 | 2.0% | |
| ❣ | 29 | 1.3% | |
| ✡ | 20 | 0.9% | |
| ✉ | 17 | 0.8% | |
| ❗ | 15 | 0.7% | |
| ✏ | 14 | 0.6% | |
| ✖ | 11 | 0.5% | |
| ❇ | 9 | 0.4% | |
| ✺ | 9 | 0.4% | |
| ✧ | 8 | 0.4% | |
| ✒ | 6 | 0.3% | |
| ✴ | 6 | 0.3% | |
| ➕ | 5 | 0.2% | |
| ➧ | 4 | 0.2% | |
| ✶ | 4 | 0.2% | |
| Other values (22) | 39 | 1.7% |
Most frequent Diacriticals characters
| Value | Count | Frequency (%) | |
| ͟ | 481 | 87.3% | |
| ̶ | 13 | 2.4% | |
| ̞ | 10 | 1.8% | |
| ̈ | 8 | 1.5% | |
| ͌ | 4 | 0.7% | |
| ̽ | 4 | 0.7% | |
| ́ | 2 | 0.4% | |
| ͛ | 2 | 0.4% | |
| ͖ | 2 | 0.4% | |
| ̆ | 2 | 0.4% | |
| ̂ | 2 | 0.4% | |
| ͚ | 2 | 0.4% | |
| ̟ | 2 | 0.4% | |
| ͎ | 2 | 0.4% | |
| ̾ | 2 | 0.4% | |
| ͜ | 2 | 0.4% | |
| ͡ | 2 | 0.4% | |
| ̵ | 2 | 0.4% | |
| ̅ | 1 | 0.2% | |
| ̡ | 1 | 0.2% | |
| ̨ | 1 | 0.2% | |
| ̄ | 1 | 0.2% | |
| ͥ | 1 | 0.2% | |
| ͣ | 1 | 0.2% | |
| ͫ | 1 | 0.2% |
Most frequent Enclosed Alphanum Sup characters
| Value | Count | Frequency (%) | |
| 🇺 | 1144 | 12.3% | |
| 🇮 | 940 | 10.1% | |
| 🇳 | 865 | 9.3% | |
| 🇸 | 817 | 8.8% | |
| 🇪 | 663 | 7.2% | |
| 🇦 | 533 | 5.8% | |
| 🇧 | 489 | 5.3% | |
| 🇨 | 469 | 5.1% | |
| 🇬 | 447 | 4.8% | |
| 🇷 | 428 | 4.6% | |
| 🇵 | 325 | 3.5% | |
| 🇰 | 291 | 3.1% | |
| 🇹 | 255 | 2.8% | |
| 🇭 | 251 | 2.7% | |
| 🇲 | 246 | 2.7% | |
| 🇱 | 225 | 2.4% | |
| 🇩 | 148 | 1.6% | |
| 🇿 | 131 | 1.4% | |
| 🇫 | 104 | 1.1% | |
| 🇴 | 75 | 0.8% | |
| 🇼 | 74 | 0.8% | |
| 🇾 | 68 | 0.7% | |
| 🇻 | 58 | 0.6% | |
| 🇽 | 51 | 0.6% | |
| 🇯 | 44 | 0.5% | |
| Other values (43) | 128 | 1.4% |
Most frequent Latin 1 Sup characters
| Value | Count | Frequency (%) | |
| é | 549 | 22.2% | |
| í | 211 | 8.5% | |
| 171 | 6.9% | ||
| ó | 165 | 6.7% | |
| á | 150 | 6.1% | |
| ® | 148 | 6.0% | |
| ñ | 119 | 4.8% | |
| ä | 85 | 3.4% | |
| ü | 68 | 2.8% | |
| è | 60 | 2.4% | |
| à | 58 | 2.3% | |
| | 52 | 2.1% | |
| ö | 46 | 1.9% | |
| ¦ | 45 | 1.8% | |
| ç | 44 | 1.8% | |
| ° | 40 | 1.6% | |
| ê | 38 | 1.5% | |
| ú | 34 | 1.4% | |
| ¡ | 28 | 1.1% | |
| Ü | 23 | 0.9% | |
| © | 22 | 0.9% | |
| · | 22 | 0.9% | |
| â | 20 | 0.8% | |
| Ö | 20 | 0.8% | |
| « | 16 | 0.6% | |
| Other values (47) | 236 | 9.6% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 995 | 14.4% | |
| ل | 659 | 9.6% | |
| ي | 426 | 6.2% | |
| م | 402 | 5.8% | |
| ن | 381 | 5.5% | |
| ر | 376 | 5.5% | |
| و | 365 | 5.3% | |
| ت | 256 | 3.7% | |
| ب | 223 | 3.2% | |
| ع | 217 | 3.1% | |
| س | 214 | 3.1% | |
| د | 214 | 3.1% | |
| ة | 191 | 2.8% | |
| ف | 127 | 1.8% | |
| ح | 126 | 1.8% | |
| ه | 124 | 1.8% | |
| ق | 111 | 1.6% | |
| ك | 108 | 1.6% | |
| ی | 106 | 1.5% | |
| ش | 81 | 1.2% | |
| خ | 80 | 1.2% | |
| َ | 74 | 1.1% | |
| ج | 71 | 1.0% | |
| أ | 71 | 1.0% | |
| ط | 69 | 1.0% | |
| Other values (41) | 829 | 12.0% |
Most frequent Geometric Shapes characters
| Value | Count | Frequency (%) | |
| ▫ | 137 | 32.4% | |
| ▪ | 134 | 31.7% | |
| ● | 73 | 17.3% | |
| ▶ | 14 | 3.3% | |
| ► | 12 | 2.8% | |
| ◾ | 11 | 2.6% | |
| ▸ | 10 | 2.4% | |
| ○ | 7 | 1.7% | |
| ◆ | 6 | 1.4% | |
| ◈ | 4 | 0.9% | |
| ◔ | 4 | 0.9% | |
| ◘ | 4 | 0.9% | |
| ■ | 3 | 0.7% | |
| △ | 1 | 0.2% | |
| ◻ | 1 | 0.2% | |
| ◼ | 1 | 0.2% | |
| ◄ | 1 | 0.2% |
Most frequent Tags characters
| Value | Count | Frequency (%) | |
| | 226 | 22.3% | |
| | 170 | 16.7% | |
| | 168 | 16.6% | |
| | 113 | 11.1% | |
| | 71 | 7.0% | |
| | 70 | 6.9% | |
| | 56 | 5.5% | |
| | 56 | 5.5% | |
| | 43 | 4.2% | |
| | 42 | 4.1% |
Most frequent Emoticons characters
| Value | Count | Frequency (%) | |
| 🙏 | 379 | 24.9% | |
| 😷 | 162 | 10.6% | |
| 😎 | 141 | 9.3% | |
| 😊 | 90 | 5.9% | |
| 😍 | 62 | 4.1% | |
| 😉 | 54 | 3.5% | |
| 😇 | 53 | 3.5% | |
| 🙌 | 45 | 3.0% | |
| 😂 | 44 | 2.9% | |
| 😀 | 43 | 2.8% | |
| 😁 | 41 | 2.7% | |
| 🙂 | 29 | 1.9% | |
| 😜 | 28 | 1.8% | |
| 🙃 | 23 | 1.5% | |
| 😘 | 20 | 1.3% | |
| 😏 | 20 | 1.3% | |
| 😸 | 19 | 1.2% | |
| 😄 | 19 | 1.2% | |
| 🙈 | 17 | 1.1% | |
| 😈 | 16 | 1.0% | |
| 😠 | 15 | 1.0% | |
| 😻 | 14 | 0.9% | |
| 😅 | 13 | 0.9% | |
| 😺 | 10 | 0.7% | |
| 😋 | 10 | 0.7% | |
| Other values (40) | 157 | 10.3% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| ा | 2061 | 8.3% | |
| र | 1902 | 7.6% | |
| ् | 1886 | 7.6% | |
| क | 1211 | 4.9% | |
| म | 1186 | 4.8% | |
| त | 1173 | 4.7% | |
| ि | 1108 | 4.4% | |
| े | 1078 | 4.3% | |
| स | 947 | 3.8% | |
| ह | 903 | 3.6% | |
| न | 858 | 3.4% | |
| ी | 741 | 3.0% | |
| ं | 709 | 2.8% | |
| व | 698 | 2.8% | |
| य | 682 | 2.7% | |
| द | 624 | 2.5% | |
| ो | 579 | 2.3% | |
| ज | 478 | 1.9% | |
| ु | 464 | 1.9% | |
| ल | 436 | 1.7% | |
| प | 426 | 1.7% | |
| । | 378 | 1.5% | |
| भ | 354 | 1.4% | |
| ष | 341 | 1.4% | |
| ग | 326 | 1.3% | |
| Other values (49) | 3378 | 13.6% |
Most frequent Math Operators characters
| Value | Count | Frequency (%) | |
| ≠ | 447 | 89.2% | |
| ∆ | 14 | 2.8% | |
| ∂ | 13 | 2.6% | |
| ≫ | 6 | 1.2% | |
| ∙ | 6 | 1.2% | |
| ∞ | 5 | 1.0% | |
| ≈ | 3 | 0.6% | |
| ⋆ | 2 | 0.4% | |
| ∣ | 2 | 0.4% | |
| ∈ | 1 | 0.2% | |
| ⋂ | 1 | 0.2% | |
| − | 1 | 0.2% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| и | 532 | 9.3% | |
| с | 473 | 8.2% | |
| о | 443 | 7.7% | |
| а | 422 | 7.3% | |
| е | 333 | 5.8% | |
| н | 314 | 5.5% | |
| р | 252 | 4.4% | |
| т | 249 | 4.3% | |
| к | 242 | 4.2% | |
| л | 220 | 3.8% | |
| в | 212 | 3.7% | |
| у | 153 | 2.7% | |
| д | 152 | 2.6% | |
| м | 126 | 2.2% | |
| ь | 120 | 2.1% | |
| я | 111 | 1.9% | |
| п | 97 | 1.7% | |
| з | 96 | 1.7% | |
| й | 95 | 1.7% | |
| П | 89 | 1.5% | |
| ы | 87 | 1.5% | |
| х | 82 | 1.4% | |
| Р | 72 | 1.3% | |
| ю | 69 | 1.2% | |
| ц | 61 | 1.1% | |
| Other values (42) | 645 | 11.2% |
Most frequent Math Alphanum characters
| Value | Count | Frequency (%) | |
| 𝐞 | 73 | 2.6% | |
| 𝕖 | 66 | 2.4% | |
| 𝐧 | 52 | 1.9% | |
| 𝐢 | 48 | 1.7% | |
| 𝐚 | 47 | 1.7% | |
| 𝕣 | 46 | 1.6% | |
| 𝕒 | 45 | 1.6% | |
| 𝕥 | 44 | 1.6% | |
| 𝐭 | 44 | 1.6% | |
| 𝐫 | 42 | 1.5% | |
| 𝓉 | 38 | 1.4% | |
| 𝒾 | 37 | 1.3% | |
| 𝐨 | 35 | 1.2% | |
| 𝕠 | 32 | 1.1% | |
| 𝐬 | 31 | 1.1% | |
| 𝕚 | 30 | 1.1% | |
| 𝕤 | 29 | 1.0% | |
| 𝑒 | 28 | 1.0% | |
| 𝒆 | 28 | 1.0% | |
| 𝕟 | 24 | 0.9% | |
| 𝒕 | 24 | 0.9% | |
| 𝓮 | 24 | 0.9% | |
| 𝓃 | 23 | 0.8% | |
| 𝐡 | 23 | 0.8% | |
| 𝒂 | 22 | 0.8% | |
| Other values (350) | 1872 | 66.7% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| ー | 86 | 15.1% | |
| イ | 53 | 9.3% | |
| ン | 33 | 5.8% | |
| ド | 29 | 5.1% | |
| ス | 28 | 4.9% | |
| ツ | 28 | 4.9% | |
| ・ | 27 | 4.8% | |
| ッ | 20 | 3.5% | |
| バ | 18 | 3.2% | |
| フ | 15 | 2.6% | |
| コ | 14 | 2.5% | |
| タ | 14 | 2.5% | |
| ォ | 14 | 2.5% | |
| ロ | 13 | 2.3% | |
| ビ | 13 | 2.3% | |
| ア | 12 | 2.1% | |
| オ | 12 | 2.1% | |
| サ | 11 | 1.9% | |
| ト | 11 | 1.9% | |
| ク | 8 | 1.4% | |
| シ | 7 | 1.2% | |
| カ | 7 | 1.2% | |
| ヌ | 7 | 1.2% | |
| メ | 6 | 1.1% | |
| ャ | 6 | 1.1% | |
| Other values (29) | 76 | 13.4% |
Most frequent Misc Technical characters
| Value | Count | Frequency (%) | |
| ⏳ | 21 | 47.7% | |
| ⏩ | 13 | 29.5% | |
| ⏰ | 3 | 6.8% | |
| ⌨ | 2 | 4.5% | |
| ⏱ | 2 | 4.5% | |
| ⌬ | 2 | 4.5% | |
| ⏬ | 1 | 2.3% |
Most frequent Latin Ext A characters
| Value | Count | Frequency (%) | |
| ı | 48 | 20.3% | |
| ř | 26 | 11.0% | |
| š | 25 | 10.6% | |
| İ | 18 | 7.6% | |
| ě | 15 | 6.4% | |
| ā | 14 | 5.9% | |
| ğ | 8 | 3.4% | |
| ł | 8 | 3.4% | |
| ş | 6 | 2.5% | |
| ż | 6 | 2.5% | |
| ą | 6 | 2.5% | |
| ę | 6 | 2.5% | |
| đ | 6 | 2.5% | |
| ů | 5 | 2.1% | |
| ō | 5 | 2.1% | |
| č | 5 | 2.1% | |
| ī | 4 | 1.7% | |
| ž | 4 | 1.7% | |
| ă | 4 | 1.7% | |
| ś | 4 | 1.7% | |
| ć | 2 | 0.8% | |
| ū | 2 | 0.8% | |
| ő | 1 | 0.4% | |
| ľ | 1 | 0.4% | |
| Č | 1 | 0.4% | |
| Other values (6) | 6 | 2.5% |
Most frequent CJK characters
| Value | Count | Frequency (%) | |
| 研 | 60 | 1.9% | |
| 人 | 58 | 1.8% | |
| 新 | 57 | 1.8% | |
| 学 | 56 | 1.7% | |
| 国 | 53 | 1.6% | |
| 究 | 49 | 1.5% | |
| 大 | 44 | 1.4% | |
| 中 | 42 | 1.3% | |
| 港 | 39 | 1.2% | |
| 師 | 34 | 1.1% | |
| 実 | 33 | 1.0% | |
| 京 | 31 | 1.0% | |
| 日 | 30 | 0.9% | |
| 立 | 29 | 0.9% | |
| 我 | 29 | 0.9% | |
| 香 | 29 | 0.9% | |
| 命 | 27 | 0.8% | |
| 本 | 27 | 0.8% | |
| 語 | 26 | 0.8% | |
| 革 | 25 | 0.8% | |
| 民 | 25 | 0.8% | |
| 和 | 25 | 0.8% | |
| 报 | 24 | 0.7% | |
| 共 | 24 | 0.7% | |
| 主 | 24 | 0.7% | |
| Other values (547) | 2335 | 72.2% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| の | 56 | 7.7% | |
| て | 46 | 6.3% | |
| で | 45 | 6.1% | |
| は | 44 | 6.0% | |
| し | 40 | 5.5% | |
| い | 40 | 5.5% | |
| た | 39 | 5.3% | |
| を | 35 | 4.8% | |
| り | 34 | 4.6% | |
| っ | 32 | 4.4% | |
| な | 29 | 4.0% | |
| と | 22 | 3.0% | |
| る | 22 | 3.0% | |
| す | 20 | 2.7% | |
| に | 19 | 2.6% | |
| せ | 17 | 2.3% | |
| き | 16 | 2.2% | |
| あ | 15 | 2.0% | |
| ら | 14 | 1.9% | |
| ま | 14 | 1.9% | |
| ね | 11 | 1.5% | |
| が | 11 | 1.5% | |
| く | 11 | 1.5% | |
| れ | 8 | 1.1% | |
| も | 8 | 1.1% | |
| Other values (26) | 84 | 11.5% |
Most frequent Phonetic Ext characters
| Value | Count | Frequency (%) | |
| ᴇ | 124 | 16.1% | |
| ᴀ | 78 | 10.1% | |
| ᴏ | 75 | 9.7% | |
| ᴛ | 70 | 9.1% | |
| ᴅ | 42 | 5.4% | |
| ᴄ | 41 | 5.3% | |
| ᵃ | 38 | 4.9% | |
| ᵉ | 34 | 4.4% | |
| ᴍ | 34 | 4.4% | |
| ᴜ | 31 | 4.0% | |
| ᵗ | 26 | 3.4% | |
| ᴘ | 26 | 3.4% | |
| ᵈ | 18 | 2.3% | |
| ᵘ | 17 | 2.2% | |
| ᵒ | 16 | 2.1% | |
| ᴠ | 15 | 1.9% | |
| ᴡ | 15 | 1.9% | |
| ᴋ | 13 | 1.7% | |
| ᵏ | 10 | 1.3% | |
| ᵐ | 7 | 0.9% | |
| ᵛ | 6 | 0.8% | |
| ᵖ | 5 | 0.6% | |
| ᴊ | 4 | 0.5% | |
| ᵇ | 4 | 0.5% | |
| ᴾ | 3 | 0.4% | |
| Other values (14) | 19 | 2.5% |
Most frequent Phonetic Ext Sup characters
| Value | Count | Frequency (%) | |
| ᶜ | 12 | 50.0% | |
| ᶠ | 9 | 37.5% | |
| ᶰ | 2 | 8.3% | |
| ᶤ | 1 | 4.2% |
Most frequent Modifier Letters characters
| Value | Count | Frequency (%) | |
| ʳ | 19 | 25.3% | |
| ˢ | 19 | 25.3% | |
| ʰ | 14 | 18.7% | |
| ˡ | 12 | 16.0% | |
| ʲ | 4 | 5.3% | |
| ʸ | 2 | 2.7% | |
| ː | 2 | 2.7% | |
| ʷ | 2 | 2.7% | |
| ʻ | 1 | 1.3% |
Most frequent Enclosed Alphanum characters
| Value | Count | Frequency (%) | |
| ⓔ | 14 | 16.3% | |
| ⓣ | 9 | 10.5% | |
| Ⓐ | 7 | 8.1% | |
| ⓢ | 6 | 7.0% | |
| ⓝ | 6 | 7.0% | |
| Ⓜ | 4 | 4.7% | |
| ⓡ | 4 | 4.7% | |
| ⓞ | 4 | 4.7% | |
| Ⓒ | 3 | 3.5% | |
| Ⓡ | 3 | 3.5% | |
| Ⓔ | 3 | 3.5% | |
| Ⓝ | 3 | 3.5% | |
| ⓐ | 3 | 3.5% | |
| Ⓥ | 3 | 3.5% | |
| Ⓣ | 2 | 2.3% | |
| ⓦ | 2 | 2.3% | |
| ⓓ | 2 | 2.3% | |
| ⓜ | 2 | 2.3% | |
| ⓘ | 1 | 1.2% | |
| Ⓞ | 1 | 1.2% | |
| Ⓤ | 1 | 1.2% | |
| Ⓘ | 1 | 1.2% | |
| Ⓨ | 1 | 1.2% | |
| Ⓖ | 1 | 1.2% |
Most frequent Letterlike Symbols characters
| Value | Count | Frequency (%) | |
| ™ | 37 | 45.7% | |
| ℓ | 17 | 21.0% | |
| ℯ | 7 | 8.6% | |
| ℴ | 5 | 6.2% | |
| ℐ | 3 | 3.7% | |
| ℍ | 3 | 3.7% | |
| ℂ | 2 | 2.5% | |
| ℝ | 2 | 2.5% | |
| ℃ | 2 | 2.5% | |
| ℑ | 1 | 1.2% | |
| ℙ | 1 | 1.2% | |
| ℭ | 1 | 1.2% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 리 | 3 | 3.5% | |
| 뉴 | 3 | 3.5% | |
| 스 | 3 | 3.5% | |
| 신 | 2 | 2.4% | |
| 의 | 2 | 2.4% | |
| 어 | 2 | 2.4% | |
| 트 | 2 | 2.4% | |
| 기 | 2 | 2.4% | |
| 아 | 2 | 2.4% | |
| 이 | 2 | 2.4% | |
| 평 | 1 | 1.2% | |
| 일 | 1 | 1.2% | |
| 오 | 1 | 1.2% | |
| 전 | 1 | 1.2% | |
| 헨 | 1 | 1.2% | |
| 영 | 1 | 1.2% | |
| 시 | 1 | 1.2% | |
| 사 | 1 | 1.2% | |
| 팩 | 1 | 1.2% | |
| 체 | 1 | 1.2% | |
| 크 | 1 | 1.2% | |
| 한 | 1 | 1.2% | |
| 국 | 1 | 1.2% | |
| 담 | 1 | 1.2% | |
| 당 | 1 | 1.2% | |
| Other values (47) | 47 | 55.3% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 285 | 16.8% | |
| ம | 164 | 9.7% | |
| ி | 150 | 8.8% | |
| த | 141 | 8.3% | |
| க | 97 | 5.7% | |
| ல | 92 | 5.4% | |
| ன | 76 | 4.5% | |
| ழ | 73 | 4.3% | |
| ு | 69 | 4.1% | |
| ா | 65 | 3.8% | |
| ப | 48 | 2.8% | |
| ந | 45 | 2.7% | |
| ர | 44 | 2.6% | |
| வ | 44 | 2.6% | |
| ய | 36 | 2.1% | |
| ட | 33 | 1.9% | |
| ச | 33 | 1.9% | |
| இ | 29 | 1.7% | |
| ே | 27 | 1.6% | |
| ை | 24 | 1.4% | |
| ோ | 17 | 1.0% | |
| ற | 14 | 0.8% | |
| ண | 14 | 0.8% | |
| ெ | 12 | 0.7% | |
| உ | 11 | 0.6% | |
| Other values (14) | 53 | 3.1% |
Most frequent PUA characters
| Value | Count | Frequency (%) | |
| | 19 | 76.0% | |
| | 3 | 12.0% | |
| | 3 | 12.0% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠀ | 40 | 100.0% |
Most frequent IPA Ext characters
| Value | Count | Frequency (%) | |
| ɪ | 85 | 20.0% | |
| ʀ | 78 | 18.4% | |
| ɴ | 70 | 16.5% | |
| ʟ | 58 | 13.7% | |
| ʜ | 28 | 6.6% | |
| ʏ | 23 | 5.4% | |
| ɢ | 21 | 5.0% | |
| ʙ | 19 | 4.5% | |
| ʇ | 7 | 1.7% | |
| ɹ | 5 | 1.2% | |
| ɟ | 4 | 0.9% | |
| ɔ | 3 | 0.7% | |
| ʌ | 3 | 0.7% | |
| ʎ | 3 | 0.7% | |
| ɥ | 3 | 0.7% | |
| ɾ | 3 | 0.7% | |
| ɑ | 2 | 0.5% | |
| ʃ | 1 | 0.2% | |
| ɐ | 1 | 0.2% | |
| ɯ | 1 | 0.2% | |
| ɱ | 1 | 0.2% | |
| ɛ | 1 | 0.2% | |
| ɫ | 1 | 0.2% | |
| ʋ | 1 | 0.2% | |
| ʂ | 1 | 0.2% |
Most frequent Latin Ext D characters
| Value | Count | Frequency (%) | |
| ꜱ | 42 | 72.4% | |
| ꜰ | 14 | 24.1% | |
| ꟼ | 2 | 3.4% |
Most frequent Arrows characters
| Value | Count | Frequency (%) | |
| ↓ | 13 | 31.7% | |
| → | 7 | 17.1% | |
| ⇔ | 3 | 7.3% | |
| ← | 3 | 7.3% | |
| ↑ | 2 | 4.9% | |
| ↺ | 2 | 4.9% | |
| ⇨ | 2 | 4.9% | |
| ↗ | 2 | 4.9% | |
| ⇒ | 2 | 4.9% | |
| ↩ | 1 | 2.4% | |
| ↪ | 1 | 2.4% | |
| ↘ | 1 | 2.4% | |
| ↔ | 1 | 2.4% | |
| ↝ | 1 | 2.4% |
Most frequent Egyptian Hieroglyphs characters
| Value | Count | Frequency (%) | |
| 𓅓 | 1 | 33.3% | |
| 𓆉 | 1 | 33.3% | |
| 𓇽 | 1 | 33.3% |
Most frequent Old Turkic characters
| Value | Count | Frequency (%) | |
| 𐰀 | 3 | 15.8% | |
| 𐰤 | 2 | 10.5% | |
| 𐰢 | 2 | 10.5% | |
| 𐰆 | 2 | 10.5% | |
| 𐰇 | 2 | 10.5% | |
| 𐱃 | 1 | 5.3% | |
| 𐰞 | 1 | 5.3% | |
| 𐱅 | 1 | 5.3% | |
| 𐰼 | 1 | 5.3% | |
| 𐰚 | 1 | 5.3% | |
| 𐰓 | 1 | 5.3% | |
| 𐰃 | 1 | 5.3% | |
| 𐰘 | 1 | 5.3% |
Most frequent Oriya characters
| Value | Count | Frequency (%) | |
| ି | 10 | 9.2% | |
| ା | 8 | 7.3% | |
| ଆ | 7 | 6.4% | |
| ୍ | 7 | 6.4% | |
| ପ | 6 | 5.5% | |
| ର | 6 | 5.5% | |
| ଲ | 4 | 3.7% | |
| ଓ | 4 | 3.7% | |
| ଡ | 4 | 3.7% | |
| ଼ | 4 | 3.7% | |
| ନ | 4 | 3.7% | |
| ସ | 3 | 2.8% | |
| ଖ | 3 | 2.8% | |
| ୀ | 3 | 2.8% | |
| ଣ | 3 | 2.8% | |
| ମ | 3 | 2.8% | |
| ବ | 3 | 2.8% | |
| ତ | 2 | 1.8% | |
| େ | 2 | 1.8% | |
| କ | 2 | 1.8% | |
| ଶ | 2 | 1.8% | |
| ୱ | 2 | 1.8% | |
| ୁ | 2 | 1.8% | |
| ଦ | 2 | 1.8% | |
| ଷ | 2 | 1.8% | |
| Other values (11) | 11 | 10.1% |
Most frequent Currency Symbols characters
| Value | Count | Frequency (%) | |
| € | 21 | 87.5% | |
| ₿ | 2 | 8.3% | |
| ₩ | 1 | 4.2% |
Most frequent Box Drawing characters
| Value | Count | Frequency (%) | |
| │ | 44 | 77.2% | |
| ┋ | 5 | 8.8% | |
| ┇ | 3 | 5.3% | |
| ┌ | 2 | 3.5% | |
| ┐ | 2 | 3.5% | |
| ║ | 1 | 1.8% |
Most frequent Bengali characters
| Value | Count | Frequency (%) | |
| া | 79 | 14.5% | |
| ি | 39 | 7.1% | |
| র | 37 | 6.8% | |
| ব | 34 | 6.2% | |
| ্ | 25 | 4.6% | |
| ন | 25 | 4.6% | |
| ত | 22 | 4.0% | |
| স | 19 | 3.5% | |
| দ | 18 | 3.3% | |
| ে | 18 | 3.3% | |
| ক | 18 | 3.3% | |
| প | 14 | 2.6% | |
| হ | 14 | 2.6% | |
| ল | 14 | 2.6% | |
| জ | 13 | 2.4% | |
| ু | 12 | 2.2% | |
| ী | 11 | 2.0% | |
| ই | 11 | 2.0% | |
| অ | 10 | 1.8% | |
| য় | 9 | 1.6% | |
| ভ | 9 | 1.6% | |
| ঙ | 8 | 1.5% | |
| ম | 8 | 1.5% | |
| উ | 7 | 1.3% | |
| য | 7 | 1.3% | |
| Other values (20) | 65 | 11.9% |
Most frequent Cuneiform characters
| Value | Count | Frequency (%) | |
| 𒀭 | 2 | 100.0% |
Most frequent Sinhala characters
| Value | Count | Frequency (%) | |
| ි | 12 | 11.8% | |
| ව | 8 | 7.8% | |
| ් | 7 | 6.9% | |
| ු | 6 | 5.9% | |
| ෙ | 6 | 5.9% | |
| ය | 6 | 5.9% | |
| ල | 5 | 4.9% | |
| න | 5 | 4.9% | |
| ප | 4 | 3.9% | |
| ක | 4 | 3.9% | |
| ම | 4 | 3.9% | |
| ා | 3 | 2.9% | |
| ද | 3 | 2.9% | |
| හ | 3 | 2.9% | |
| ඩ | 3 | 2.9% | |
| ස | 2 | 2.0% | |
| බ | 2 | 2.0% | |
| ර | 2 | 2.0% | |
| ඒ | 2 | 2.0% | |
| උ | 2 | 2.0% | |
| ඟ | 2 | 2.0% | |
| ො | 1 | 1.0% | |
| ට | 1 | 1.0% | |
| ත | 1 | 1.0% | |
| එ | 1 | 1.0% | |
| Other values (7) | 7 | 6.9% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| า | 39 | 7.2% | |
| ม | 37 | 6.8% | |
| ร | 32 | 5.9% | |
| น | 28 | 5.1% | |
| ่ | 24 | 4.4% | |
| อ | 21 | 3.9% | |
| ก | 20 | 3.7% | |
| ั | 18 | 3.3% | |
| ค | 18 | 3.3% | |
| เ | 18 | 3.3% | |
| ย | 17 | 3.1% | |
| ไ | 16 | 2.9% | |
| ิ | 14 | 2.6% | |
| ้ | 14 | 2.6% | |
| ี | 12 | 2.2% | |
| ต | 12 | 2.2% | |
| ง | 12 | 2.2% | |
| ์ | 11 | 2.0% | |
| ว | 11 | 2.0% | |
| ท | 11 | 2.0% | |
| ็ | 11 | 2.0% | |
| บ | 11 | 2.0% | |
| ภ | 10 | 1.8% | |
| ป | 9 | 1.7% | |
| ล | 9 | 1.7% | |
| Other values (28) | 110 | 20.2% |
Most frequent Sup Arrows B characters
| Value | Count | Frequency (%) | |
| ⤵ | 13 | 100.0% |
Most frequent Number Forms characters
| Value | Count | Frequency (%) | |
| ⅜ | 1 | 100.0% |
Most frequent Greek Ext characters
| Value | Count | Frequency (%) | |
| ῶ | 3 | 23.1% | |
| Ἐ | 1 | 7.7% | |
| ἴ | 1 | 7.7% | |
| ῳ | 1 | 7.7% | |
| ἀ | 1 | 7.7% | |
| ὼ | 1 | 7.7% | |
| ῥ | 1 | 7.7% | |
| ῖ | 1 | 7.7% | |
| Ὁ | 1 | 7.7% | |
| ἡ | 1 | 7.7% | |
| ὲ | 1 | 7.7% |
Most frequent Block Elements characters
| Value | Count | Frequency (%) | |
| ▌ | 14 | 46.7% | |
| █ | 13 | 43.3% | |
| ▒ | 3 | 10.0% |
Most frequent Latin Ext B characters
| Value | Count | Frequency (%) | |
| ǝ | 12 | 36.4% | |
| ș | 6 | 18.2% | |
| ț | 3 | 9.1% | |
| Ƹ | 3 | 9.1% | |
| Ʒ | 3 | 9.1% | |
| Ɔ | 2 | 6.1% | |
| ư | 2 | 6.1% | |
| ƈ | 1 | 3.0% | |
| ƒ | 1 | 3.0% |
Most frequent Playing Cards characters
| Value | Count | Frequency (%) | |
| 🃏 | 3 | 100.0% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
|  | 6 | 75.0% | |
| � | 2 | 25.0% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ನ | 107 | 9.1% | |
| ್ | 96 | 8.2% | |
| ಿ | 90 | 7.6% | |
| ಕ | 80 | 6.8% | |
| ರ | 80 | 6.8% | |
| ಾ | 68 | 5.8% | |
| ು | 53 | 4.5% | |
| ಮ | 50 | 4.2% | |
| ಯ | 48 | 4.1% | |
| ಗ | 44 | 3.7% | |
| ದ | 43 | 3.7% | |
| ತ | 41 | 3.5% | |
| ೆ | 37 | 3.1% | |
| ಡ | 28 | 2.4% | |
| ಂ | 27 | 2.3% | |
| ಜ | 27 | 2.3% | |
| ಳ | 25 | 2.1% | |
| ೇ | 23 | 2.0% | |
| ಹ | 21 | 1.8% | |
| ಪ | 19 | 1.6% | |
| ವ | 19 | 1.6% | |
| ಟ | 17 | 1.4% | |
| ೂ | 14 | 1.2% | |
| ಅ | 14 | 1.2% | |
| ಚ | 13 | 1.1% | |
| Other values (18) | 93 | 7.9% |
Most frequent Hebrew characters
| Value | Count | Frequency (%) | |
| א | 22 | 11.1% | |
| ו | 18 | 9.1% | |
| י | 16 | 8.1% | |
| ל | 16 | 8.1% | |
| ר | 15 | 7.6% | |
| ש | 13 | 6.6% | |
| ת | 9 | 4.5% | |
| ה | 8 | 4.0% | |
| ְ | 8 | 4.0% | |
| ָ | 8 | 4.0% | |
| ע | 6 | 3.0% | |
| מ | 6 | 3.0% | |
| ב | 6 | 3.0% | |
| ד | 5 | 2.5% | |
| ג | 5 | 2.5% | |
| ֶ | 5 | 2.5% | |
| ך | 4 | 2.0% | |
| כ | 3 | 1.5% | |
| ם | 3 | 1.5% | |
| ּ | 3 | 1.5% | |
| ַ | 3 | 1.5% | |
| נ | 2 | 1.0% | |
| ח | 2 | 1.0% | |
| צ | 2 | 1.0% | |
| ף | 2 | 1.0% | |
| Other values (6) | 8 | 4.0% |
Most frequent Geometric Shapes Ext characters
| Value | Count | Frequency (%) | |
| 🟢 | 6 | 27.3% | |
| 🟥 | 5 | 22.7% | |
| 🟩 | 4 | 18.2% | |
| 🟨 | 3 | 13.6% | |
| 🟧 | 2 | 9.1% | |
| 🟡 | 1 | 4.5% | |
| 🟣 | 1 | 4.5% |
Most frequent Gurmukhi characters
| Value | Count | Frequency (%) | |
| ਾ | 20 | 10.3% | |
| ੀ | 16 | 8.2% | |
| ਸ | 14 | 7.2% | |
| ਂ | 9 | 4.6% | |
| ਕ | 9 | 4.6% | |
| ਰ | 8 | 4.1% | |
| ਜ | 7 | 3.6% | |
| ਬ | 7 | 3.6% | |
| ਤ | 7 | 3.6% | |
| ਦ | 7 | 3.6% | |
| ਵ | 7 | 3.6% | |
| ੁ | 6 | 3.1% | |
| ਨ | 6 | 3.1% | |
| ਹ | 6 | 3.1% | |
| ਗ | 5 | 2.6% | |
| ਿ | 5 | 2.6% | |
| ਼ | 5 | 2.6% | |
| ਲ | 4 | 2.1% | |
| ਪ | 4 | 2.1% | |
| ੰ | 4 | 2.1% | |
| ੇ | 4 | 2.1% | |
| ਝ | 4 | 2.1% | |
| ਮ | 4 | 2.1% | |
| ੋ | 4 | 2.1% | |
| ਅ | 2 | 1.0% | |
| Other values (17) | 21 | 10.8% |
Most frequent Georgian characters
| Value | Count | Frequency (%) | |
| ღ | 7 | 70.0% | |
| ყ | 2 | 20.0% | |
| Ⴆ | 1 | 10.0% |
Most frequent Gujarati characters
| Value | Count | Frequency (%) | |
| ા | 12 | 12.4% | |
| ર | 7 | 7.2% | |
| ી | 7 | 7.2% | |
| ુ | 6 | 6.2% | |
| વ | 6 | 6.2% | |
| ં | 6 | 6.2% | |
| ત | 5 | 5.2% | |
| ે | 5 | 5.2% | |
| મ | 5 | 5.2% | |
| ગ | 4 | 4.1% | |
| જ | 3 | 3.1% | |
| ્ | 3 | 3.1% | |
| બ | 3 | 3.1% | |
| ો | 3 | 3.1% | |
| ક | 3 | 3.1% | |
| છ | 2 | 2.1% | |
| લ | 2 | 2.1% | |
| ચ | 2 | 2.1% | |
| ણ | 2 | 2.1% | |
| હ | 1 | 1.0% | |
| ખ | 1 | 1.0% | |
| થ | 1 | 1.0% | |
| ળ | 1 | 1.0% | |
| િ | 1 | 1.0% | |
| અ | 1 | 1.0% | |
| Other values (5) | 5 | 5.2% |
Most frequent Malayalam characters
| Value | Count | Frequency (%) | |
| ് | 14 | 12.1% | |
| ക | 10 | 8.6% | |
| ന | 10 | 8.6% | |
| ത | 9 | 7.8% | |
| ി | 7 | 6.0% | |
| ാ | 6 | 5.2% | |
| ു | 5 | 4.3% | |
| മ | 4 | 3.4% | |
| ര | 4 | 3.4% | |
| യ | 4 | 3.4% | |
| ശ | 4 | 3.4% | |
| ീ | 3 | 2.6% | |
| െ | 3 | 2.6% | |
| ങ | 3 | 2.6% | |
| ൻ | 3 | 2.6% | |
| പ | 3 | 2.6% | |
| ച | 2 | 1.7% | |
| ഭ | 2 | 1.7% | |
| ൃ | 2 | 1.7% | |
| ൂ | 2 | 1.7% | |
| ർ | 2 | 1.7% | |
| ള | 2 | 1.7% | |
| ൽ | 1 | 0.9% | |
| ൊ | 1 | 0.9% | |
| വ | 1 | 0.9% | |
| Other values (9) | 9 | 7.8% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| ్ | 52 | 8.7% | |
| ు | 47 | 7.9% | |
| ా | 37 | 6.2% | |
| ి | 36 | 6.0% | |
| న | 34 | 5.7% | |
| ల | 31 | 5.2% | |
| త | 31 | 5.2% | |
| క | 28 | 4.7% | |
| మ | 27 | 4.5% | |
| ర | 27 | 4.5% | |
| వ | 20 | 3.4% | |
| ం | 19 | 3.2% | |
| గ | 15 | 2.5% | |
| స | 15 | 2.5% | |
| ప | 15 | 2.5% | |
| ె | 15 | 2.5% | |
| ట | 13 | 2.2% | |
| ో | 13 | 2.2% | |
| య | 12 | 2.0% | |
| ే | 11 | 1.8% | |
| అ | 10 | 1.7% | |
| చ | 10 | 1.7% | |
| ష | 9 | 1.5% | |
| డ | 9 | 1.5% | |
| భ | 8 | 1.3% | |
| Other values (21) | 52 | 8.7% |
Most frequent Tibetan characters
| Value | Count | Frequency (%) | |
| ࿗ | 14 | 77.8% | |
| ༒ | 4 | 22.2% |
Most frequent Tagalog characters
| Value | Count | Frequency (%) | |
| ᜉ | 1 | 25.0% | |
| ᜂ | 1 | 25.0% | |
| ᜎ | 1 | 25.0% | |
| ᜓ | 1 | 25.0% |
Most frequent UCAS characters
| Value | Count | Frequency (%) | |
| ᗩ | 23 | 16.1% | |
| ᑎ | 17 | 11.9% | |
| ᔕ | 16 | 11.2% | |
| ᒪ | 15 | 10.5% | |
| ᖇ | 14 | 9.8% | |
| ᗰ | 10 | 7.0% | |
| ᑕ | 9 | 6.3% | |
| ᗪ | 8 | 5.6% | |
| ᐯ | 6 | 4.2% | |
| ᑭ | 5 | 3.5% | |
| ᑌ | 4 | 2.8% | |
| ᗯ | 2 | 1.4% | |
| ᕼ | 2 | 1.4% | |
| ᗷ | 2 | 1.4% | |
| ᗴ | 2 | 1.4% | |
| ᐃ | 2 | 1.4% | |
| ᒍ | 1 | 0.7% | |
| ᖴ | 1 | 0.7% | |
| ᖃ | 1 | 0.7% | |
| ᓗ | 1 | 0.7% | |
| ᑦ | 1 | 0.7% | |
| ᗡ | 1 | 0.7% |
Most frequent Coptic characters
| Value | Count | Frequency (%) | |
| Ⲁ | 2 | 28.6% | |
| Ⲍ | 2 | 28.6% | |
| Ⲧ | 2 | 28.6% | |
| ⲁ | 1 | 14.3% |
Most frequent Javanese characters
| Value | Count | Frequency (%) | |
| ꧁ | 2 | 50.0% | |
| ꧂ | 2 | 50.0% |
Most frequent Ol Chiki characters
| Value | Count | Frequency (%) | |
| ᱛ | 2 | 100.0% |
Most frequent Cherokee characters
| Value | Count | Frequency (%) | |
| Ꮹ | 2 | 40.0% | |
| Ᏻ | 1 | 20.0% | |
| Ꮃ | 1 | 20.0% | |
| Ꭹ | 1 | 20.0% |
Most frequent Arabic PF A characters
| Value | Count | Frequency (%) | |
| ﷺ | 2 | 40.0% | |
| ﴿ | 1 | 20.0% | |
| ﴾ | 1 | 20.0% | |
| ﷽ | 1 | 20.0% |
Most frequent Latin Ext Additional characters
| Value | Count | Frequency (%) | |
| ệ | 3 | 15.0% | |
| ố | 3 | 15.0% | |
| ộ | 2 | 10.0% | |
| ứ | 2 | 10.0% | |
| ờ | 2 | 10.0% | |
| ị | 2 | 10.0% | |
| ự | 1 | 5.0% | |
| ủ | 1 | 5.0% | |
| ể | 1 | 5.0% | |
| ồ | 1 | 5.0% | |
| ỹ | 1 | 5.0% | |
| ọ | 1 | 5.0% |
Most frequent Sup Punctuation characters
| Value | Count | Frequency (%) | |
| ⸮ | 1 | 100.0% |
Most frequent Misc Math Symbols A characters
| Value | Count | Frequency (%) | |
| ⟭ | 4 | 50.0% | |
| ⟬ | 4 | 50.0% |
Most frequent Armenian characters
| Value | Count | Frequency (%) | |
| ա | 8 | 23.5% | |
| կ | 4 | 11.8% | |
| ե | 3 | 8.8% | |
| տ | 2 | 5.9% | |
| ն | 2 | 5.9% | |
| ր | 2 | 5.9% | |
| Ք | 1 | 2.9% | |
| ո | 1 | 2.9% | |
| ղ | 1 | 2.9% | |
| վ | 1 | 2.9% | |
| զ | 1 | 2.9% | |
| ը | 1 | 2.9% | |
| ֍ | 1 | 2.9% | |
| ֎ | 1 | 2.9% | |
| Փ | 1 | 2.9% | |
| ս | 1 | 2.9% | |
| չ | 1 | 2.9% | |
| հ | 1 | 2.9% | |
| ի | 1 | 2.9% |
Most frequent Misc Math Symbols B characters
| Value | Count | Frequency (%) | |
| ⦁ | 2 | 66.7% | |
| ⧓ | 1 | 33.3% |
Most frequent Runic characters
| Value | Count | Frequency (%) | |
| ᛉ | 1 | 50.0% | |
| ᛟ | 1 | 50.0% |
Most frequent Sup PUA A characters
| Value | Count | Frequency (%) | |
| | 2 | 100.0% |
Most frequent Tifinagh characters
| Value | Count | Frequency (%) | |
| ⵣ | 1 | 100.0% |
Most frequent Ethiopic characters
| Value | Count | Frequency (%) | |
| ል | 1 | 16.7% | |
| ማ | 1 | 16.7% | |
| ት | 1 | 16.7% | |
| ፖ | 1 | 16.7% | |
| ሊ | 1 | 16.7% | |
| ሲ | 1 | 16.7% |
Most frequent Compat Jamo characters
| Value | Count | Frequency (%) | |
| ㅣ | 2 | 100.0% |
Most frequent Lao characters
| Value | Count | Frequency (%) | |
| ໒ | 1 | 50.0% | |
| ຮ | 1 | 50.0% |
Most frequent Yi Radicals characters
| Value | Count | Frequency (%) | |
| ꒱ | 1 | 100.0% |
Most frequent Bamum Sup characters
| Value | Count | Frequency (%) | |
| 𖥻 | 1 | 100.0% |
| Distinct | 25871 |
|---|---|
| Distinct (%) | 56.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 360.0 KiB |
| 2010-09-20 17:01:08 | 1026 |
|---|---|
| 2009-04-22 12:55:28 | 283 |
| 2020-05-21 15:54:09 | 246 |
| 2019-12-31 06:11:12 | 184 |
| 2020-08-11 09:12:38 | 170 |
| Other values (25866) |
| Value | Count | Frequency (%) | |
| 2010-09-20 17:01:08 | 1026 | 2.2% | |
| 2009-04-22 12:55:28 | 283 | 0.6% | |
| 2020-05-21 15:54:09 | 246 | 0.5% | |
| 2019-12-31 06:11:12 | 184 | 0.4% | |
| 2020-08-11 09:12:38 | 170 | 0.4% | |
| 2015-05-22 08:31:12 | 135 | 0.3% | |
| 2019-03-25 17:58:43 | 132 | 0.3% | |
| 2009-03-16 03:03:13 | 126 | 0.3% | |
| 2020-09-17 18:16:07 | 121 | 0.3% | |
| 2012-05-23 02:53:47 | 119 | 0.3% | |
| 2011-05-23 15:00:26 | 110 | 0.2% | |
| 2009-07-09 09:04:01 | 91 | 0.2% | |
| 2015-01-02 14:13:17 | 90 | 0.2% | |
| 2017-02-20 08:41:38 | 88 | 0.2% | |
| 2013-01-24 03:18:59 | 79 | 0.2% | |
| 2009-07-25 08:41:05 | 73 | 0.2% | |
| 2015-01-16 04:52:01 | 68 | 0.1% | |
| 2019-08-22 13:21:22 | 68 | 0.1% | |
| 2018-02-04 12:36:42 | 65 | 0.1% | |
| 2009-08-11 06:12:45 | 57 | 0.1% | |
| 2017-12-29 11:04:46 | 56 | 0.1% | |
| 2011-07-20 00:59:59 | 55 | 0.1% | |
| 2019-10-19 12:24:33 | 55 | 0.1% | |
| 2010-05-08 13:21:45 | 55 | 0.1% | |
| 2013-07-19 11:26:39 | 55 | 0.1% | |
| Other values (25846) | 42452 | 92.2% |
Frequencies of value counts
Unique
| Unique | 20528 ? |
|---|---|
| Unique (%) | 44.6% |
Histogram of lengths of the category
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 170636 | 19.5% | |
| 1 | 127176 | 14.5% | |
| 2 | 120739 | 13.8% | |
| - | 92118 | 10.5% | |
| : | 92118 | 10.5% | |
| 46059 | 5.3% | ||
| 3 | 43527 | 5.0% | |
| 5 | 39138 | 4.5% | |
| 4 | 38316 | 4.4% | |
| 9 | 32570 | 3.7% | |
| 8 | 25782 | 2.9% | |
| 7 | 23922 | 2.7% | |
| 6 | 23020 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 644826 | 73.7% | |
| Dash Punctuation | 92118 | 10.5% | |
| Other Punctuation | 92118 | 10.5% | |
| Space Separator | 46059 | 5.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 170636 | 26.5% | |
| 1 | 127176 | 19.7% | |
| 2 | 120739 | 18.7% | |
| 3 | 43527 | 6.8% | |
| 5 | 39138 | 6.1% | |
| 4 | 38316 | 5.9% | |
| 9 | 32570 | 5.1% | |
| 8 | 25782 | 4.0% | |
| 7 | 23922 | 3.7% | |
| 6 | 23020 | 3.6% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 92118 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 46059 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| : | 92118 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 875121 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 170636 | 19.5% | |
| 1 | 127176 | 14.5% | |
| 2 | 120739 | 13.8% | |
| - | 92118 | 10.5% | |
| : | 92118 | 10.5% | |
| 46059 | 5.3% | ||
| 3 | 43527 | 5.0% | |
| 5 | 39138 | 4.5% | |
| 4 | 38316 | 4.4% | |
| 9 | 32570 | 3.7% | |
| 8 | 25782 | 2.9% | |
| 7 | 23922 | 2.7% | |
| 6 | 23020 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 875121 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 170636 | 19.5% | |
| 1 | 127176 | 14.5% | |
| 2 | 120739 | 13.8% | |
| - | 92118 | 10.5% | |
| : | 92118 | 10.5% | |
| 46059 | 5.3% | ||
| 3 | 43527 | 5.0% | |
| 5 | 39138 | 4.5% | |
| 4 | 38316 | 4.4% | |
| 9 | 32570 | 3.7% | |
| 8 | 25782 | 2.9% | |
| 7 | 23922 | 2.7% | |
| 6 | 23020 | 2.6% |
user_followers
Real number (ℝ≥0)
| Distinct | 9752 |
|---|---|
| Distinct (%) | 21.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103497.3731 |
|---|---|
| Minimum | 0 |
| Maximum | 14919786 |
| Zeros | 335 |
| Zeros (%) | 0.7% |
| Memory size | 360.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 123 |
| median | 584 |
| Q3 | 2705.5 |
| 95-th percentile | 95576.7 |
| Maximum | 14919786 |
| Range | 14919786 |
| Interquartile range (IQR) | 2582.5 |
Descriptive statistics
| Standard deviation | 854781.0032 |
|---|---|
| Coefficient of variation (CV) | 8.258963275 |
| Kurtosis | 173.9796234 |
| Mean | 103497.3731 |
| Median Absolute Deviation (MAD) | 556 |
| Skewness | 12.42369974 |
| Sum | 4766985506 |
| Variance | 7.306505634e+11 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 335 | 0.7% | |
| 1 | 303 | 0.7% | |
| 3 | 238 | 0.5% | |
| 1190 | 232 | 0.5% | |
| 4 | 218 | 0.5% | |
| 10 | 218 | 0.5% | |
| 2 | 210 | 0.5% | |
| 7 | 210 | 0.5% | |
| 6 | 193 | 0.4% | |
| 16 | 188 | 0.4% | |
| 12 | 188 | 0.4% | |
| 5 | 166 | 0.4% | |
| 8 | 163 | 0.4% | |
| 13 | 158 | 0.3% | |
| 1179 | 155 | 0.3% | |
| 14 | 143 | 0.3% | |
| 9 | 142 | 0.3% | |
| 1177 | 142 | 0.3% | |
| 15 | 128 | 0.3% | |
| 11 | 126 | 0.3% | |
| 17 | 125 | 0.3% | |
| 26 | 116 | 0.3% | |
| 50 | 114 | 0.2% | |
| 27 | 112 | 0.2% | |
| 22 | 112 | 0.2% | |
| Other values (9727) | 41624 | 90.4% |
| Value | Count | Frequency (%) | |
| 0 | 335 | 0.7% | |
| 1 | 303 | 0.7% | |
| 2 | 210 | 0.5% | |
| 3 | 238 | 0.5% | |
| 4 | 218 | 0.5% | |
| 5 | 166 | 0.4% | |
| 6 | 193 | 0.4% | |
| 7 | 210 | 0.5% | |
| 8 | 163 | 0.4% | |
| 9 | 142 | 0.3% |
| Value | Count | Frequency (%) | |
| 14919786 | 2 | < 0.1% | |
| 14879495 | 1 | < 0.1% | |
| 14879493 | 1 | < 0.1% | |
| 14873025 | 2 | < 0.1% | |
| 14859597 | 1 | < 0.1% | |
| 14856742 | 3 | < 0.1% | |
| 14856740 | 3 | < 0.1% | |
| 14838357 | 1 | < 0.1% | |
| 14824139 | 1 | < 0.1% | |
| 14811850 | 1 | < 0.1% |
| Distinct | 5199 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1334.568011 |
|---|---|
| Minimum | 0 |
| Maximum | 380428 |
| Zeros | 397 |
| Zeros (%) | 0.9% |
| Memory size | 360.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 148 |
| median | 425 |
| Q3 | 1222 |
| 95-th percentile | 4870.1 |
| Maximum | 380428 |
| Range | 380428 |
| Interquartile range (IQR) | 1074 |
Descriptive statistics
| Standard deviation | 5998.529071 |
|---|---|
| Coefficient of variation (CV) | 4.494734644 |
| Kurtosis | 2033.415311 |
| Mean | 1334.568011 |
| Median Absolute Deviation (MAD) | 352 |
| Skewness | 37.72401569 |
| Sum | 61468868 |
| Variance | 35982351.02 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 397 | 0.9% | |
| 1 | 307 | 0.7% | |
| 306 | 263 | 0.6% | |
| 206 | 259 | 0.6% | |
| 6 | 253 | 0.5% | |
| 196 | 224 | 0.5% | |
| 3 | 217 | 0.5% | |
| 197 | 179 | 0.4% | |
| 2 | 177 | 0.4% | |
| 141 | 155 | 0.3% | |
| 10 | 150 | 0.3% | |
| 142 | 147 | 0.3% | |
| 45 | 146 | 0.3% | |
| 7 | 145 | 0.3% | |
| 25 | 143 | 0.3% | |
| 70 | 137 | 0.3% | |
| 144 | 123 | 0.3% | |
| 22 | 120 | 0.3% | |
| 28 | 120 | 0.3% | |
| 5001 | 113 | 0.2% | |
| 38 | 111 | 0.2% | |
| 21 | 107 | 0.2% | |
| 17 | 106 | 0.2% | |
| 26 | 105 | 0.2% | |
| 107 | 105 | 0.2% | |
| Other values (5174) | 41750 | 90.6% |
| Value | Count | Frequency (%) | |
| 0 | 397 | 0.9% | |
| 1 | 307 | 0.7% | |
| 2 | 177 | 0.4% | |
| 3 | 217 | 0.5% | |
| 4 | 86 | 0.2% | |
| 5 | 93 | 0.2% | |
| 6 | 253 | 0.5% | |
| 7 | 145 | 0.3% | |
| 8 | 99 | 0.2% | |
| 9 | 97 | 0.2% |
| Value | Count | Frequency (%) | |
| 380428 | 1 | < 0.1% | |
| 380362 | 2 | < 0.1% | |
| 380353 | 1 | < 0.1% | |
| 380265 | 1 | < 0.1% | |
| 274718 | 1 | < 0.1% | |
| 273812 | 1 | < 0.1% | |
| 195289 | 1 | < 0.1% | |
| 149813 | 1 | < 0.1% | |
| 149723 | 1 | < 0.1% | |
| 149699 | 1 | < 0.1% |
| Distinct | 16866 |
|---|---|
| Distinct (%) | 36.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15462.41948 |
|---|---|
| Minimum | 0 |
| Maximum | 1205878 |
| Zeros | 671 |
| Zeros (%) | 1.5% |
| Memory size | 360.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13 |
| Q1 | 379 |
| median | 2225 |
| Q3 | 11555.5 |
| 95-th percentile | 71728 |
| Maximum | 1205878 |
| Range | 1205878 |
| Interquartile range (IQR) | 11176.5 |
Descriptive statistics
| Standard deviation | 42933.05137 |
|---|---|
| Coefficient of variation (CV) | 2.776606301 |
| Kurtosis | 109.6825323 |
| Mean | 15462.41948 |
| Median Absolute Deviation (MAD) | 2185 |
| Skewness | 8.244921903 |
| Sum | 712183579 |
| Variance | 1843246900 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 671 | 1.5% | |
| 3 | 288 | 0.6% | |
| 1 | 237 | 0.5% | |
| 1837 | 229 | 0.5% | |
| 24 | 208 | 0.5% | |
| 25 | 177 | 0.4% | |
| 535 | 161 | 0.3% | |
| 2 | 155 | 0.3% | |
| 1692 | 151 | 0.3% | |
| 4 | 148 | 0.3% | |
| 5 | 143 | 0.3% | |
| 14 | 113 | 0.2% | |
| 1063 | 113 | 0.2% | |
| 7 | 112 | 0.2% | |
| 10 | 111 | 0.2% | |
| 34 | 109 | 0.2% | |
| 6 | 108 | 0.2% | |
| 15 | 103 | 0.2% | |
| 13 | 96 | 0.2% | |
| 33 | 94 | 0.2% | |
| 19 | 90 | 0.2% | |
| 17 | 88 | 0.2% | |
| 11 | 87 | 0.2% | |
| 502 | 81 | 0.2% | |
| 32 | 81 | 0.2% | |
| Other values (16841) | 42105 | 91.4% |
| Value | Count | Frequency (%) | |
| 0 | 671 | 1.5% | |
| 1 | 237 | 0.5% | |
| 2 | 155 | 0.3% | |
| 3 | 288 | 0.6% | |
| 4 | 148 | 0.3% | |
| 5 | 143 | 0.3% | |
| 6 | 108 | 0.2% | |
| 7 | 112 | 0.2% | |
| 8 | 75 | 0.2% | |
| 9 | 77 | 0.2% |
| Value | Count | Frequency (%) | |
| 1205878 | 1 | < 0.1% | |
| 948246 | 1 | < 0.1% | |
| 947901 | 1 | < 0.1% | |
| 946118 | 2 | < 0.1% | |
| 924667 | 1 | < 0.1% | |
| 886935 | 1 | < 0.1% | |
| 870993 | 1 | < 0.1% | |
| 850337 | 1 | < 0.1% | |
| 777462 | 1 | < 0.1% | |
| 773740 | 1 | < 0.1% |
user_verified
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.1 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 40999 | 89.0% | |
| True | 5060 | 11.0% |
| Distinct | 45622 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 360.0 KiB |
| 2021-03-02 23:02:10 | 4 |
|---|---|
| 2021-02-09 07:30:00 | 3 |
| 2021-03-30 01:30:00 | 3 |
| 2021-02-24 08:30:00 | 3 |
| 2021-02-15 00:00:56 | 3 |
| Other values (45617) |
| Value | Count | Frequency (%) | |
| 2021-03-02 23:02:10 | 4 | < 0.1% | |
| 2021-02-09 07:30:00 | 3 | < 0.1% | |
| 2021-03-30 01:30:00 | 3 | < 0.1% | |
| 2021-02-24 08:30:00 | 3 | < 0.1% | |
| 2021-02-15 00:00:56 | 3 | < 0.1% | |
| 2021-03-02 17:50:24 | 3 | < 0.1% | |
| 2021-03-01 04:52:10 | 3 | < 0.1% | |
| 2021-02-13 00:30:00 | 3 | < 0.1% | |
| 2021-03-01 06:37:02 | 3 | < 0.1% | |
| 2021-03-02 23:02:08 | 3 | < 0.1% | |
| 2021-03-02 05:30:00 | 3 | < 0.1% | |
| 2021-03-01 03:09:31 | 3 | < 0.1% | |
| 2021-03-31 21:25:03 | 2 | < 0.1% | |
| 2021-03-26 09:52:48 | 2 | < 0.1% | |
| 2021-03-29 23:00:01 | 2 | < 0.1% | |
| 2021-03-10 06:50:21 | 2 | < 0.1% | |
| 2021-03-04 09:12:56 | 2 | < 0.1% | |
| 2021-02-28 10:57:00 | 2 | < 0.1% | |
| 2021-03-01 04:26:59 | 2 | < 0.1% | |
| 2021-03-16 08:25:40 | 2 | < 0.1% | |
| 2021-02-07 10:02:38 | 2 | < 0.1% | |
| 2021-02-09 08:41:26 | 2 | < 0.1% | |
| 2021-03-09 13:15:00 | 2 | < 0.1% | |
| 2021-04-01 14:30:00 | 2 | < 0.1% | |
| 2021-03-31 06:16:59 | 2 | < 0.1% | |
| Other values (45597) | 45996 | 99.9% |
Frequencies of value counts
Unique
| Unique | 45198 ? |
|---|---|
| Unique (%) | 98.1% |
Histogram of lengths of the category
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 161238 | 18.4% | |
| 2 | 158818 | 18.1% | |
| 1 | 125061 | 14.3% | |
| - | 92118 | 10.5% | |
| : | 92118 | 10.5% | |
| 3 | 64035 | 7.3% | |
| 46059 | 5.3% | ||
| 4 | 35027 | 4.0% | |
| 5 | 32372 | 3.7% | |
| 6 | 17530 | 2.0% | |
| 9 | 17077 | 2.0% | |
| 8 | 16981 | 1.9% | |
| 7 | 16687 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 644826 | 73.7% | |
| Dash Punctuation | 92118 | 10.5% | |
| Other Punctuation | 92118 | 10.5% | |
| Space Separator | 46059 | 5.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 161238 | 25.0% | |
| 2 | 158818 | 24.6% | |
| 1 | 125061 | 19.4% | |
| 3 | 64035 | 9.9% | |
| 4 | 35027 | 5.4% | |
| 5 | 32372 | 5.0% | |
| 6 | 17530 | 2.7% | |
| 9 | 17077 | 2.6% | |
| 8 | 16981 | 2.6% | |
| 7 | 16687 | 2.6% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 92118 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 46059 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| : | 92118 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 875121 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 161238 | 18.4% | |
| 2 | 158818 | 18.1% | |
| 1 | 125061 | 14.3% | |
| - | 92118 | 10.5% | |
| : | 92118 | 10.5% | |
| 3 | 64035 | 7.3% | |
| 46059 | 5.3% | ||
| 4 | 35027 | 4.0% | |
| 5 | 32372 | 3.7% | |
| 6 | 17530 | 2.0% | |
| 9 | 17077 | 2.0% | |
| 8 | 16981 | 1.9% | |
| 7 | 16687 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 875121 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 161238 | 18.4% | |
| 2 | 158818 | 18.1% | |
| 1 | 125061 | 14.3% | |
| - | 92118 | 10.5% | |
| : | 92118 | 10.5% | |
| 3 | 64035 | 7.3% | |
| 46059 | 5.3% | ||
| 4 | 35027 | 4.0% | |
| 5 | 32372 | 3.7% | |
| 6 | 17530 | 2.0% | |
| 9 | 17077 | 2.0% | |
| 8 | 16981 | 1.9% | |
| 7 | 16687 | 1.9% |
| Distinct | 46018 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 360.0 KiB |
| @POTUS What about #Covaxin from #Ocugen ?! It seems like it's better than anything we have now! Is it coming to the US? Why or why not?? | 5 |
|---|---|
| #Covid19 Vaccine Rollout Needs Spark Even More Innovation https://t.co/EBaYqJexm1 #Ergotron #VaccinationCart #Pfizer #PfizerBioNTech | 5 |
| @Reuters Do you know the meaning of “V” in the name of the first Russian vaccine- #SputnikV? “V” for Victory. Victory over the #pandemic. | 5 |
| @sputnikvaccine Not even a majority of all Russians are willing to take this vaccin! #SputnikV @sputnikvaccine. | 3 |
| AIIMS at which PM Modi took #Covaxin was built by Nehru. | 3 |
| Other values (46013) |
| Value | Count | Frequency (%) | |
| @POTUS What about #Covaxin from #Ocugen ?! It seems like it's better than anything we have now! Is it coming to the US? Why or why not?? | 5 | < 0.1% | |
| #Covid19 Vaccine Rollout Needs Spark Even More Innovation https://t.co/EBaYqJexm1 #Ergotron #VaccinationCart #Pfizer #PfizerBioNTech | 5 | < 0.1% | |
| @Reuters Do you know the meaning of “V” in the name of the first Russian vaccine- #SputnikV? “V” for Victory. Victory over the #pandemic. | 5 | < 0.1% | |
| @sputnikvaccine Not even a majority of all Russians are willing to take this vaccin! #SputnikV @sputnikvaccine. | 3 | < 0.1% | |
| AIIMS at which PM Modi took #Covaxin was built by Nehru. | 3 | < 0.1% | |
| #Moderna Post jobs for free on https://t.co/Jxbtzryhtg | 2 | < 0.1% | |
| भिखारी Pakistan to receive ‘Made in India’ COVID-19 vaccines from GAVI #COVID19Vaccine #Covaxin #CoronaVirusUpdates | 2 | < 0.1% | |
| @visshnumittal Is there a quality difference in #Covaxin & #CovishieldVaccine ??? | 2 | < 0.1% | |
| So yesterday was rough - fatigue headache and chills. #PfizerBioNTech Better today so far. | 2 | < 0.1% | |
| @WHO @DrTedros Dr faucii = Also know as “the NEW angel of death” #modernA version of #JosefMengele | 2 | < 0.1% | |
| Russia in talks with several Austrian companies on #SputnikV production, RDIF CEO says @sputnikvaccine https://t.co/lp4X9OgURa | 2 | < 0.1% | |
| DOH: Some providers gave out 2nd doses of #Moderna vaccine as 1st doses in mishap https://t.co/9ohs4noIPN | 2 | < 0.1% | |
| @sputnikvaccine @sputnikvaccine #SputnikV doesn't works, it's just a fraud 👇 https://t.co/FuXlfHHxqa | 2 | < 0.1% | |
| Vaccinated and ready to dominate the world once again!😊💯💯 https://t.co/9kFWzG4fIE #dubai #dha @DHA_Dubai #vaccine #PfizerBioNTech | 2 | < 0.1% | |
| PMO India says PM took the first dose of Bharat Biotech's #Covaxin Bharat Biotech | 2 | < 0.1% | |
| 24 hours after the 2nd #Moderna shot and, honestly, I feel like runny baby poo. Definitely on auto-pilot today & tomorrow. | 2 | < 0.1% | |
| 'Dr Reddy's expects #SputnikV vaccine to get approval from Indian regulator in next few weeks' https://t.co/sXHXveqOz9 | 2 | < 0.1% | |
| @WHO @DrTedros 500 deaths today in Italy. Shame to everyone! #SputnikV could save them. | 2 | < 0.1% | |
| test #covaxin | 2 | < 0.1% | |
| Afghanistan and Russia to discuss #SputnikV vaccine supplies soon, foreign minister Says @sputnikvaccine https://t.co/tfCtOaFC7M | 2 | < 0.1% | |
| 1st dose of #vaccine just now #OxfordAstraZeneca Woohoo 😃 | 2 | < 0.1% | |
| Russia, Turkey in talks on joint production of #SputnikV #COVID19Vaccine, ambassador says @sputnikvaccine https://t.co/adLhfJS5es | 2 | < 0.1% | |
| Vaccinated! #Moderna | 2 | < 0.1% | |
| @idnani_nandini #SastaBhiKargarBhi #Covaxin of Bharat Biotech | 2 | < 0.1% | |
| PHARMACY and Poisons Board confirms approval for emergency use of Russia’s #SputnikV COVID vaccine in Kenya after tests. | 2 | < 0.1% | |
| Other values (45993) | 45998 | 99.9% |
Frequencies of value counts
Unique
| Unique | 45988 ? |
|---|---|
| Unique (%) | 99.8% |
Histogram of lengths of the category
Length
| Max length | 156 |
|---|---|
| Median length | 139 |
| Mean length | 126.1271413 |
| Min length | 13 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 721198 | 12.4% | ||
| e | 390068 | 6.7% | |
| t | 386085 | 6.6% | |
| a | 323525 | 5.6% | |
| o | 323158 | 5.6% | |
| i | 292141 | 5.0% | |
| n | 274557 | 4.7% | |
| s | 251218 | 4.3% | |
| c | 209618 | 3.6% | |
| r | 208095 | 3.6% | |
| h | 181481 | 3.1% | |
| d | 145507 | 2.5% | |
| / | 128374 | 2.2% | |
| l | 112396 | 1.9% | |
| p | 111586 | 1.9% | |
| u | 90895 | 1.6% | |
| # | 86389 | 1.5% | |
| f | 83761 | 1.4% | |
| v | 83527 | 1.4% | |
| m | 77430 | 1.3% | |
| . | 73324 | 1.3% | |
| y | 68142 | 1.2% | |
| g | 63501 | 1.1% | |
| C | 47192 | 0.8% | |
| w | 47051 | 0.8% | |
| Other values (1375) | 1029071 | 17.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3867595 | 66.6% | |
| Space Separator | 721428 | 12.4% | |
| Uppercase Letter | 567569 | 9.8% | |
| Other Punctuation | 440883 | 7.6% | |
| Decimal Number | 139070 | 2.4% | |
| Control | 28218 | 0.5% | |
| Other Symbol | 13041 | 0.2% | |
| Dash Punctuation | 9703 | 0.2% | |
| Final Punctuation | 6034 | 0.1% | |
| Connector Punctuation | 3723 | 0.1% | |
| Open Punctuation | 2043 | < 0.1% | |
| Close Punctuation | 1834 | < 0.1% | |
| Other Letter | 1519 | < 0.1% | |
| Nonspacing Mark | 1366 | < 0.1% | |
| Currency Symbol | 1229 | < 0.1% | |
| Math Symbol | 1159 | < 0.1% | |
| Initial Punctuation | 1026 | < 0.1% | |
| Modifier Symbol | 887 | < 0.1% | |
| Format | 545 | < 0.1% | |
| Spacing Mark | 149 | < 0.1% | |
| Modifier Letter | 132 | < 0.1% | |
| Enclosing Mark | 128 | < 0.1% | |
| Other Number | 9 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 47192 | 8.3% | |
| I | 42864 | 7.6% | |
| V | 35880 | 6.3% | |
| S | 33966 | 6.0% | |
| M | 31192 | 5.5% | |
| T | 30039 | 5.3% | |
| A | 30008 | 5.3% | |
| O | 29948 | 5.3% | |
| D | 28022 | 4.9% | |
| P | 26760 | 4.7% | |
| N | 24720 | 4.4% | |
| B | 23443 | 4.1% | |
| E | 18467 | 3.3% | |
| R | 18002 | 3.2% | |
| H | 16688 | 2.9% | |
| W | 15306 | 2.7% | |
| F | 14920 | 2.6% | |
| G | 14809 | 2.6% | |
| U | 13965 | 2.5% | |
| L | 12409 | 2.2% | |
| K | 11337 | 2.0% | |
| J | 11125 | 2.0% | |
| Z | 10909 | 1.9% | |
| Y | 9733 | 1.7% | |
| X | 8341 | 1.5% | |
| Other values (79) | 7524 | 1.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 390068 | 10.1% | |
| t | 386085 | 10.0% | |
| a | 323525 | 8.4% | |
| o | 323158 | 8.4% | |
| i | 292141 | 7.6% | |
| n | 274557 | 7.1% | |
| s | 251218 | 6.5% | |
| c | 209618 | 5.4% | |
| r | 208095 | 5.4% | |
| h | 181481 | 4.7% | |
| d | 145507 | 3.8% | |
| l | 112396 | 2.9% | |
| p | 111586 | 2.9% | |
| u | 90895 | 2.4% | |
| f | 83761 | 2.2% | |
| v | 83527 | 2.2% | |
| m | 77430 | 2.0% | |
| y | 68142 | 1.8% | |
| g | 63501 | 1.6% | |
| w | 47051 | 1.2% | |
| b | 43109 | 1.1% | |
| k | 37535 | 1.0% | |
| x | 20499 | 0.5% | |
| z | 19957 | 0.5% | |
| j | 13075 | 0.3% | |
| Other values (182) | 9678 | 0.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 721198 | > 99.9% | ||
| 215 | < 0.1% | ||
| 7 | < 0.1% | ||
| 4 | < 0.1% | ||
| 4 | < 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 128374 | 29.1% | |
| # | 86389 | 19.6% | |
| . | 73324 | 16.6% | |
| : | 46877 | 10.6% | |
| … | 32434 | 7.4% | |
| @ | 23281 | 5.3% | |
| , | 19702 | 4.5% | |
| ! | 9054 | 2.1% | |
| ' | 7828 | 1.8% | |
| ? | 3986 | 0.9% | |
| ; | 2983 | 0.7% | |
| & | 2612 | 0.6% | |
| " | 2011 | 0.5% | |
| % | 1437 | 0.3% | |
| * | 350 | 0.1% | |
| ‼ | 133 | < 0.1% | |
| • | 75 | < 0.1% | |
| · | 6 | < 0.1% | |
| ⁉ | 5 | < 0.1% | |
| ¡ | 5 | < 0.1% | |
| ・ | 4 | < 0.1% | |
| । | 3 | < 0.1% | |
| ¿ | 1 | < 0.1% | |
| 〽 | 1 | < 0.1% | |
| \ | 1 | < 0.1% | |
| Other values (7) | 7 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 27774 | 20.0% | |
| 9 | 20254 | 14.6% | |
| 0 | 17061 | 12.3% | |
| 2 | 16267 | 11.7% | |
| 3 | 10674 | 7.7% | |
| 5 | 10044 | 7.2% | |
| 4 | 9906 | 7.1% | |
| 8 | 9275 | 6.7% | |
| 6 | 9030 | 6.5% | |
| 7 | 8778 | 6.3% | |
| 𝟯 | 1 | < 0.1% | |
| 𝟱 | 1 | < 0.1% | |
| 𝟭 | 1 | < 0.1% | |
| 𝟵 | 1 | < 0.1% | |
| 𝟏 | 1 | < 0.1% | |
| 𝟗 | 1 | < 0.1% | |
| 𝟷 | 1 | < 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 3723 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 9360 | 96.5% | |
| — | 219 | 2.3% | |
| – | 122 | 1.3% | |
| ‑ | 2 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 1935 | 94.7% | |
| [ | 97 | 4.7% | |
| „ | 6 | 0.3% | |
| { | 3 | 0.1% | |
| 「 | 1 | < 0.1% | |
| ( | 1 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 1724 | 94.0% | |
| ] | 88 | 4.8% | |
| 》 | 19 | 1.0% | |
| } | 2 | 0.1% | |
| ) | 1 | 0.1% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 28216 | > 99.9% | ||
| 2 | < 0.1% |
Most frequent Format characters
| Value | Count | Frequency (%) | |
| | 163 | 29.9% | |
| | 143 | 26.2% | |
| | 131 | 24.0% | |
| | 87 | 16.0% | |
| | 7 | 1.3% | |
| | 2 | 0.4% | |
| | 2 | 0.4% | |
| | 2 | 0.4% | |
| | 2 | 0.4% | |
| | 2 | 0.4% | |
| | 2 | 0.4% | |
| | 1 | 0.2% | |
| | 1 | 0.2% |
Most frequent Initial Punctuation characters
| Value | Count | Frequency (%) | |
| “ | 720 | 70.2% | |
| ‘ | 295 | 28.8% | |
| « | 11 | 1.1% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| 💉 | 1727 | 13.2% | |
| 🙏 | 438 | 3.4% | |
| 🇳 | 390 | 3.0% | |
| 🇺 | 319 | 2.4% | |
| 😂 | 319 | 2.4% | |
| ✅ | 305 | 2.3% | |
| 👏 | 299 | 2.3% | |
| 💪 | 299 | 2.3% | |
| 👍 | 276 | 2.1% | |
| 🇨 | 265 | 2.0% | |
| 👇 | 252 | 1.9% | |
| ❤ | 235 | 1.8% | |
| 🇮 | 210 | 1.6% | |
| 🇷 | 178 | 1.4% | |
| 🤣 | 176 | 1.3% | |
| 🙌 | 169 | 1.3% | |
| 😷 | 168 | 1.3% | |
| 🦠 | 167 | 1.3% | |
| 🇸 | 157 | 1.2% | |
| 🚀 | 155 | 1.2% | |
| 🇪 | 148 | 1.1% | |
| 🇦 | 147 | 1.1% | |
| 🤔 | 143 | 1.1% | |
| 😊 | 138 | 1.1% | |
| 🇬 | 132 | 1.0% | |
| Other values (537) | 5829 | 44.7% |
Most frequent Final Punctuation characters
| Value | Count | Frequency (%) | |
| ’ | 5508 | 91.3% | |
| ” | 514 | 8.5% | |
| » | 10 | 0.2% | |
| › | 2 | < 0.1% |
Most frequent Nonspacing Mark characters
| Value | Count | Frequency (%) | |
| ️ | 1137 | 83.2% | |
| े | 37 | 2.7% | |
| ् | 29 | 2.1% | |
| ं | 25 | 1.8% | |
| ் | 14 | 1.0% | |
| ี | 10 | 0.7% | |
| ิ | 10 | 0.7% | |
| ั | 9 | 0.7% | |
| ่ | 8 | 0.6% | |
| ် | 8 | 0.6% | |
| ̇ | 7 | 0.5% | |
| ̶ | 7 | 0.5% | |
| ု | 6 | 0.4% | |
| ိ | 6 | 0.4% | |
| ් | 4 | 0.3% | |
| ͟ | 4 | 0.3% | |
| ้ | 4 | 0.3% | |
| ู | 3 | 0.2% | |
| ా | 3 | 0.2% | |
| ံ | 2 | 0.1% | |
| ွ | 2 | 0.1% | |
| ီ | 2 | 0.1% | |
| ू | 2 | 0.1% | |
| ै | 2 | 0.1% | |
| ็ | 2 | 0.1% | |
| Other values (17) | 23 | 1.7% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| 🏻 | 317 | 35.7% | |
| 🏼 | 243 | 27.4% | |
| 🏽 | 155 | 17.5% | |
| 🏾 | 115 | 13.0% | |
| ` | 20 | 2.3% | |
| 🏿 | 14 | 1.6% | |
| ^ | 13 | 1.5% | |
| ´ | 10 | 1.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| | | 628 | 54.2% | |
| + | 326 | 28.1% | |
| = | 96 | 8.3% | |
| ~ | 85 | 7.3% | |
| ⤵ | 7 | 0.6% | |
| → | 6 | 0.5% | |
| × | 3 | 0.3% | |
| ± | 2 | 0.2% | |
| ⤴ | 2 | 0.2% | |
| ≥ | 2 | 0.2% | |
| | | 1 | 0.1% | |
| ≈ | 1 | 0.1% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 1171 | 95.3% | |
| € | 21 | 1.7% | |
| £ | 21 | 1.7% | |
| ₹ | 16 | 1.3% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| ا | 121 | 8.0% | |
| र | 83 | 5.5% | |
| ر | 70 | 4.6% | |
| ل | 54 | 3.6% | |
| द | 45 | 3.0% | |
| و | 40 | 2.6% | |
| ن | 38 | 2.5% | |
| ز | 36 | 2.4% | |
| न | 34 | 2.2% | |
| ج | 34 | 2.2% | |
| ی | 25 | 1.6% | |
| म | 24 | 1.6% | |
| ي | 23 | 1.5% | |
| م | 21 | 1.4% | |
| ह | 18 | 1.2% | |
| ว | 17 | 1.1% | |
| ک | 16 | 1.1% | |
| ئ | 15 | 1.0% | |
| ค | 15 | 1.0% | |
| ب | 14 | 0.9% | |
| د | 14 | 0.9% | |
| क | 14 | 0.9% | |
| ब | 14 | 0.9% | |
| น | 13 | 0.9% | |
| ज | 12 | 0.8% | |
| Other values (312) | 709 | 46.7% |
Most frequent Enclosing Mark characters
| Value | Count | Frequency (%) | |
| ⃣ | 128 | 100.0% |
Most frequent Spacing Mark characters
| Value | Count | Frequency (%) | |
| ी | 32 | 21.5% | |
| ा | 30 | 20.1% | |
| ि | 24 | 16.1% | |
| ो | 24 | 16.1% | |
| ு | 8 | 5.4% | |
| ை | 4 | 2.7% | |
| ာ | 4 | 2.7% | |
| ෙ | 3 | 2.0% | |
| ி | 3 | 2.0% | |
| ா | 2 | 1.3% | |
| ெ | 2 | 1.3% | |
| ே | 2 | 1.3% | |
| ා | 2 | 1.3% | |
| ေ | 2 | 1.3% | |
| း | 2 | 1.3% | |
| ෑ | 1 | 0.7% | |
| ோ | 1 | 0.7% | |
| ျ | 1 | 0.7% | |
| ు | 1 | 0.7% | |
| ೀ | 1 | 0.7% |
Most frequent Modifier Letter characters
| Value | Count | Frequency (%) | |
| ー | 124 | 93.9% | |
| ˈ | 7 | 5.3% | |
| ˌ | 1 | 0.8% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ❶ | 2 | 22.2% | |
| ⅓ | 1 | 11.1% | |
| ➌ | 1 | 11.1% | |
| ➎ | 1 | 11.1% | |
| ⓿ | 1 | 11.1% | |
| ❷ | 1 | 11.1% | |
| ² | 1 | 11.1% | |
| ½ | 1 | 11.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 4434130 | 76.3% | |
| Common | 1371666 | 23.6% | |
| Inherited | 1429 | < 0.1% | |
| Arabic | 639 | < 0.1% | |
| Devanagari | 549 | < 0.1% | |
| Thai | 176 | < 0.1% | |
| Han | 121 | < 0.1% | |
| Cyrillic | 110 | < 0.1% | |
| Hangul | 98 | < 0.1% | |
| Tamil | 88 | < 0.1% | |
| Greek | 79 | < 0.1% | |
| Myanmar | 67 | < 0.1% | |
| Katakana | 46 | < 0.1% | |
| Sinhala | 25 | < 0.1% | |
| Telugu | 23 | < 0.1% | |
| Hiragana | 20 | < 0.1% | |
| Kannada | 18 | < 0.1% | |
| Braille | 5 | < 0.1% | |
| Canadian_Aboriginal | 1 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 390068 | 8.8% | |
| t | 386085 | 8.7% | |
| a | 323525 | 7.3% | |
| o | 323158 | 7.3% | |
| i | 292141 | 6.6% | |
| n | 274557 | 6.2% | |
| s | 251218 | 5.7% | |
| c | 209618 | 4.7% | |
| r | 208095 | 4.7% | |
| h | 181481 | 4.1% | |
| d | 145507 | 3.3% | |
| l | 112396 | 2.5% | |
| p | 111586 | 2.5% | |
| u | 90895 | 2.0% | |
| f | 83761 | 1.9% | |
| v | 83527 | 1.9% | |
| m | 77430 | 1.7% | |
| y | 68142 | 1.5% | |
| g | 63501 | 1.4% | |
| C | 47192 | 1.1% | |
| w | 47051 | 1.1% | |
| b | 43109 | 1.0% | |
| I | 42864 | 1.0% | |
| k | 37535 | 0.8% | |
| V | 35880 | 0.8% | |
| Other values (80) | 503808 | 11.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 721198 | 52.6% | ||
| / | 128374 | 9.4% | |
| # | 86389 | 6.3% | |
| . | 73324 | 5.3% | |
| : | 46877 | 3.4% | |
| … | 32434 | 2.4% | |
| 28216 | 2.1% | ||
| 1 | 27774 | 2.0% | |
| @ | 23281 | 1.7% | |
| 9 | 20254 | 1.5% | |
| , | 19702 | 1.4% | |
| 0 | 17061 | 1.2% | |
| 2 | 16267 | 1.2% | |
| 3 | 10674 | 0.8% | |
| 5 | 10044 | 0.7% | |
| 4 | 9906 | 0.7% | |
| - | 9360 | 0.7% | |
| 8 | 9275 | 0.7% | |
| ! | 9054 | 0.7% | |
| 6 | 9030 | 0.7% | |
| 7 | 8778 | 0.6% | |
| ' | 7828 | 0.6% | |
| ’ | 5508 | 0.4% | |
| ? | 3986 | 0.3% | |
| _ | 3723 | 0.3% | |
| Other values (811) | 33349 | 2.4% |
Most frequent Inherited characters
| Value | Count | Frequency (%) | |
| ️ | 1137 | 79.6% | |
| | 143 | 10.0% | |
| ⃣ | 128 | 9.0% | |
| ̇ | 7 | 0.5% | |
| ̶ | 7 | 0.5% | |
| ͟ | 4 | 0.3% | |
| ُ | 1 | 0.1% | |
| | 1 | 0.1% | |
| ︎ | 1 | 0.1% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 121 | 18.9% | |
| ر | 70 | 11.0% | |
| ل | 54 | 8.5% | |
| و | 40 | 6.3% | |
| ن | 38 | 5.9% | |
| ز | 36 | 5.6% | |
| ج | 34 | 5.3% | |
| ی | 25 | 3.9% | |
| ي | 23 | 3.6% | |
| م | 21 | 3.3% | |
| ک | 16 | 2.5% | |
| ئ | 15 | 2.3% | |
| ب | 14 | 2.2% | |
| د | 14 | 2.2% | |
| ت | 10 | 1.6% | |
| ع | 9 | 1.4% | |
| خ | 8 | 1.3% | |
| ہ | 8 | 1.3% | |
| س | 7 | 1.1% | |
| ك | 7 | 1.1% | |
| ط | 7 | 1.1% | |
| ة | 6 | 0.9% | |
| ے | 6 | 0.9% | |
| ح | 5 | 0.8% | |
| ص | 5 | 0.8% | |
| Other values (15) | 40 | 6.3% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| र | 83 | 15.1% | |
| द | 45 | 8.2% | |
| े | 37 | 6.7% | |
| न | 34 | 6.2% | |
| ी | 32 | 5.8% | |
| ा | 30 | 5.5% | |
| ् | 29 | 5.3% | |
| ं | 25 | 4.6% | |
| ि | 24 | 4.4% | |
| म | 24 | 4.4% | |
| ो | 24 | 4.4% | |
| ह | 18 | 3.3% | |
| क | 14 | 2.6% | |
| ब | 14 | 2.6% | |
| ज | 12 | 2.2% | |
| त | 12 | 2.2% | |
| स | 10 | 1.8% | |
| ग | 9 | 1.6% | |
| व | 8 | 1.5% | |
| ए | 8 | 1.5% | |
| ल | 8 | 1.5% | |
| प | 6 | 1.1% | |
| औ | 6 | 1.1% | |
| भ | 5 | 0.9% | |
| य | 4 | 0.7% | |
| Other values (15) | 28 | 5.1% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠀ | 5 | 100.0% |
Most frequent Han characters
| Value | Count | Frequency (%) | |
| 疫 | 4 | 3.3% | |
| 科 | 4 | 3.3% | |
| 興 | 4 | 3.3% | |
| 彩 | 4 | 3.3% | |
| 来 | 3 | 2.5% | |
| 自 | 3 | 2.5% | |
| 苗 | 3 | 2.5% | |
| 新 | 2 | 1.7% | |
| 型 | 2 | 1.7% | |
| 支 | 2 | 1.7% | |
| 募 | 2 | 1.7% | |
| 集 | 2 | 1.7% | |
| 圖 | 2 | 1.7% | |
| 派 | 2 | 1.7% | |
| 六 | 2 | 1.7% | |
| 合 | 2 | 1.7% | |
| 連 | 2 | 1.7% | |
| 豬 | 2 | 1.7% | |
| 藍 | 2 | 1.7% | |
| 絲 | 2 | 1.7% | |
| 相 | 2 | 1.7% | |
| 信 | 2 | 1.7% | |
| 政 | 2 | 1.7% | |
| 府 | 2 | 1.7% | |
| 接 | 2 | 1.7% | |
| Other values (53) | 60 | 49.6% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| о | 15 | 13.6% | |
| а | 8 | 7.3% | |
| с | 8 | 7.3% | |
| в | 7 | 6.4% | |
| к | 6 | 5.5% | |
| и | 6 | 5.5% | |
| д | 6 | 5.5% | |
| н | 6 | 5.5% | |
| л | 5 | 4.5% | |
| р | 4 | 3.6% | |
| у | 4 | 3.6% | |
| т | 4 | 3.6% | |
| е | 4 | 3.6% | |
| П | 3 | 2.7% | |
| м | 3 | 2.7% | |
| Р | 2 | 1.8% | |
| ж | 2 | 1.8% | |
| ч | 2 | 1.8% | |
| б | 2 | 1.8% | |
| В | 1 | 0.9% | |
| і | 1 | 0.9% | |
| С | 1 | 0.9% | |
| Я | 1 | 0.9% | |
| я | 1 | 0.9% | |
| г | 1 | 0.9% | |
| Other values (7) | 7 | 6.4% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 이 | 9 | 9.2% | |
| 소 | 5 | 5.1% | |
| 스 | 5 | 5.1% | |
| 트 | 5 | 5.1% | |
| 해 | 3 | 3.1% | |
| 방 | 3 | 3.1% | |
| 탄 | 3 | 3.1% | |
| 년 | 3 | 3.1% | |
| 단 | 3 | 3.1% | |
| 와 | 3 | 3.1% | |
| 꽃 | 2 | 2.0% | |
| 불 | 2 | 2.0% | |
| 놀 | 2 | 2.0% | |
| 야 | 2 | 2.0% | |
| 달 | 2 | 2.0% | |
| 의 | 2 | 2.0% | |
| 녀 | 2 | 2.0% | |
| 레 | 2 | 2.0% | |
| 키 | 2 | 2.0% | |
| 즈 | 2 | 2.0% | |
| 코 | 1 | 1.0% | |
| 로 | 1 | 1.0% | |
| 나 | 1 | 1.0% | |
| 그 | 1 | 1.0% | |
| 리 | 1 | 1.0% | |
| Other values (31) | 31 | 31.6% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 14 | 15.9% | |
| க | 8 | 9.1% | |
| ு | 8 | 9.1% | |
| த | 6 | 6.8% | |
| ை | 4 | 4.5% | |
| ன | 4 | 4.5% | |
| ச | 4 | 4.5% | |
| வ | 3 | 3.4% | |
| ர | 3 | 3.4% | |
| ய | 3 | 3.4% | |
| ந | 3 | 3.4% | |
| ி | 3 | 3.4% | |
| ப | 2 | 2.3% | |
| ம | 2 | 2.3% | |
| அ | 2 | 2.3% | |
| ள | 2 | 2.3% | |
| ா | 2 | 2.3% | |
| ெ | 2 | 2.3% | |
| ே | 2 | 2.3% | |
| ட | 2 | 2.3% | |
| ல | 2 | 2.3% | |
| இ | 2 | 2.3% | |
| ஞ | 1 | 1.1% | |
| ோ | 1 | 1.1% | |
| ஒ | 1 | 1.1% | |
| Other values (2) | 2 | 2.3% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| ン | 4 | 8.7% | |
| コ | 3 | 6.5% | |
| ナ | 3 | 6.5% | |
| ウ | 3 | 6.5% | |
| イ | 3 | 6.5% | |
| ル | 3 | 6.5% | |
| ク | 3 | 6.5% | |
| ロ | 2 | 4.3% | |
| ス | 2 | 4.3% | |
| ワ | 2 | 4.3% | |
| チ | 2 | 4.3% | |
| ト | 2 | 4.3% | |
| モ | 1 | 2.2% | |
| デ | 1 | 2.2% | |
| レ | 1 | 2.2% | |
| ジ | 1 | 2.2% | |
| ャ | 1 | 2.2% | |
| ゲ | 1 | 2.2% | |
| ム | 1 | 2.2% | |
| ラ | 1 | 2.2% | |
| マ | 1 | 2.2% | |
| ア | 1 | 2.2% | |
| オ | 1 | 2.2% | |
| リ | 1 | 2.2% | |
| ピ | 1 | 2.2% |
Most frequent Sinhala characters
| Value | Count | Frequency (%) | |
| ් | 4 | 16.0% | |
| න | 3 | 12.0% | |
| ෙ | 3 | 12.0% | |
| ක | 2 | 8.0% | |
| ල | 2 | 8.0% | |
| ා | 2 | 8.0% | |
| ස | 2 | 8.0% | |
| බ | 1 | 4.0% | |
| ෑ | 1 | 4.0% | |
| ම | 1 | 4.0% | |
| ය | 1 | 4.0% | |
| ි | 1 | 4.0% | |
| ප | 1 | 4.0% | |
| ඩ | 1 | 4.0% |
Most frequent Greek characters
| Value | Count | Frequency (%) | |
| ο | 14 | 17.7% | |
| μ | 10 | 12.7% | |
| ι | 6 | 7.6% | |
| α | 5 | 6.3% | |
| ς | 4 | 5.1% | |
| ε | 4 | 5.1% | |
| β | 4 | 5.1% | |
| λ | 4 | 5.1% | |
| σ | 4 | 5.1% | |
| π | 2 | 2.5% | |
| κ | 2 | 2.5% | |
| ρ | 2 | 2.5% | |
| ν | 2 | 2.5% | |
| Δ | 2 | 2.5% | |
| Μ | 2 | 2.5% | |
| Ε | 2 | 2.5% | |
| υ | 2 | 2.5% | |
| τ | 2 | 2.5% | |
| ω | 1 | 1.3% | |
| Θ | 1 | 1.3% | |
| έ | 1 | 1.3% | |
| Σ | 1 | 1.3% | |
| ί | 1 | 1.3% | |
| Ν | 1 | 1.3% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| ว | 17 | 9.7% | |
| ค | 15 | 8.5% | |
| น | 13 | 7.4% | |
| ี | 10 | 5.7% | |
| ิ | 10 | 5.7% | |
| ั | 9 | 5.1% | |
| โ | 9 | 5.1% | |
| ่ | 8 | 4.5% | |
| ด | 8 | 4.5% | |
| ซ | 5 | 2.8% | |
| อ | 5 | 2.8% | |
| ร | 5 | 2.8% | |
| า | 5 | 2.8% | |
| ้ | 4 | 2.3% | |
| ก | 4 | 2.3% | |
| ม | 4 | 2.3% | |
| จ | 4 | 2.3% | |
| ย | 3 | 1.7% | |
| ู | 3 | 1.7% | |
| ต | 3 | 1.7% | |
| ไ | 3 | 1.7% | |
| เ | 3 | 1.7% | |
| ง | 2 | 1.1% | |
| แ | 2 | 1.1% | |
| ป | 2 | 1.1% | |
| Other values (15) | 20 | 11.4% |
Most frequent Myanmar characters
| Value | Count | Frequency (%) | |
| ် | 8 | 11.9% | |
| ု | 6 | 9.0% | |
| ိ | 6 | 9.0% | |
| င | 5 | 7.5% | |
| က | 5 | 7.5% | |
| န | 4 | 6.0% | |
| ာ | 4 | 6.0% | |
| ရ | 3 | 4.5% | |
| တ | 2 | 3.0% | |
| ံ | 2 | 3.0% | |
| ွ | 2 | 3.0% | |
| ဆ | 2 | 3.0% | |
| ေ | 2 | 3.0% | |
| း | 2 | 3.0% | |
| မ | 2 | 3.0% | |
| ီ | 2 | 3.0% | |
| ၏ | 1 | 1.5% | |
| ဗ | 1 | 1.5% | |
| စ | 1 | 1.5% | |
| ယ | 1 | 1.5% | |
| ျ | 1 | 1.5% | |
| ဒ | 1 | 1.5% | |
| သ | 1 | 1.5% | |
| ့ | 1 | 1.5% | |
| ှ | 1 | 1.5% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| ま | 2 | 10.0% | |
| え | 2 | 10.0% | |
| ら | 2 | 10.0% | |
| の | 2 | 10.0% | |
| お | 1 | 5.0% | |
| げ | 1 | 5.0% | |
| て | 1 | 5.0% | |
| け | 1 | 5.0% | |
| た | 1 | 5.0% | |
| に | 1 | 5.0% | |
| は | 1 | 5.0% | |
| し | 1 | 5.0% | |
| い | 1 | 5.0% | |
| こ | 1 | 5.0% | |
| と | 1 | 5.0% | |
| よ | 1 | 5.0% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| క | 4 | 17.4% | |
| ా | 3 | 13.0% | |
| ి | 2 | 8.7% | |
| ్ | 2 | 8.7% | |
| అ | 1 | 4.3% | |
| మ | 1 | 4.3% | |
| ె | 1 | 4.3% | |
| ర | 1 | 4.3% | |
| ు | 1 | 4.3% | |
| ొ | 1 | 4.3% | |
| వ | 1 | 4.3% | |
| గ | 1 | 4.3% | |
| జ | 1 | 4.3% | |
| న | 1 | 4.3% | |
| ట | 1 | 4.3% | |
| ీ | 1 | 4.3% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ರ | 3 | 16.7% | |
| ಿ | 2 | 11.1% | |
| ೆ | 2 | 11.1% | |
| ್ | 2 | 11.1% | |
| ಲ | 1 | 5.6% | |
| ಸ | 1 | 5.6% | |
| ಕ | 1 | 5.6% | |
| ಖ | 1 | 5.6% | |
| ೀ | 1 | 5.6% | |
| ದ | 1 | 5.6% | |
| ಗ | 1 | 5.6% | |
| ಆ | 1 | 5.6% | |
| ಡ | 1 | 5.6% |
Most frequent Canadian_Aboriginal characters
| Value | Count | Frequency (%) | |
| ᗩ | 1 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5749282 | 99.0% | |
| Punctuation | 40578 | 0.7% | |
| None | 7590 | 0.1% | |
| Enclosed Alphanum Sup | 2783 | < 0.1% | |
| Emoticons | 2314 | < 0.1% | |
| VS | 1138 | < 0.1% | |
| Dingbats | 920 | < 0.1% | |
| Math Alphanum | 856 | < 0.1% | |
| Arabic | 641 | < 0.1% | |
| Latin 1 Sup | 625 | < 0.1% | |
| Devanagari | 552 | < 0.1% | |
| Misc Symbols | 396 | < 0.1% | |
| Phonetic Ext | 319 | < 0.1% | |
| Thai | 176 | < 0.1% | |
| Katakana | 174 | < 0.1% | |
| IPA Ext | 135 | < 0.1% | |
| CJK | 121 | < 0.1% | |
| Cyrillic | 110 | < 0.1% | |
| Hangul | 98 | < 0.1% | |
| Tamil | 88 | < 0.1% | |
| Myanmar | 67 | < 0.1% | |
| Latin Ext A | 60 | < 0.1% | |
| Currency Symbols | 37 | < 0.1% | |
| Geometric Shapes | 34 | < 0.1% | |
| Sinhala | 25 | < 0.1% | |
| Other values (20) | 171 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 721198 | 12.5% | ||
| e | 390068 | 6.8% | |
| t | 386085 | 6.7% | |
| a | 323525 | 5.6% | |
| o | 323158 | 5.6% | |
| i | 292141 | 5.1% | |
| n | 274557 | 4.8% | |
| s | 251218 | 4.4% | |
| c | 209618 | 3.6% | |
| r | 208095 | 3.6% | |
| h | 181481 | 3.2% | |
| d | 145507 | 2.5% | |
| / | 128374 | 2.2% | |
| l | 112396 | 2.0% | |
| p | 111586 | 1.9% | |
| u | 90895 | 1.6% | |
| # | 86389 | 1.5% | |
| f | 83761 | 1.5% | |
| v | 83527 | 1.5% | |
| m | 77430 | 1.3% | |
| . | 73324 | 1.3% | |
| y | 68142 | 1.2% | |
| g | 63501 | 1.1% | |
| C | 47192 | 0.8% | |
| w | 47051 | 0.8% | |
| Other values (70) | 969063 | 16.9% |
Most frequent Punctuation characters
| Value | Count | Frequency (%) | |
| … | 32434 | 79.9% | |
| ’ | 5508 | 13.6% | |
| “ | 720 | 1.8% | |
| ” | 514 | 1.3% | |
| ‘ | 295 | 0.7% | |
| — | 219 | 0.5% | |
| | 163 | 0.4% | |
| | 143 | 0.4% | |
| ‼ | 133 | 0.3% | |
| | 131 | 0.3% | |
| – | 122 | 0.3% | |
| | 87 | 0.2% | |
| • | 75 | 0.2% | |
| | 7 | < 0.1% | |
| „ | 6 | < 0.1% | |
| ⁉ | 5 | < 0.1% | |
| 4 | < 0.1% | ||
| 4 | < 0.1% | ||
| ‑ | 2 | < 0.1% | |
| › | 2 | < 0.1% | |
| † | 1 | < 0.1% | |
| ․ | 1 | < 0.1% | |
| | 1 | < 0.1% | |
| | 1 | < 0.1% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| 💉 | 1727 | 22.8% | |
| 🏻 | 317 | 4.2% | |
| 👏 | 299 | 3.9% | |
| 💪 | 299 | 3.9% | |
| 👍 | 276 | 3.6% | |
| 👇 | 252 | 3.3% | |
| 🏼 | 243 | 3.2% | |
| 🤣 | 176 | 2.3% | |
| 🦠 | 167 | 2.2% | |
| 🏽 | 155 | 2.0% | |
| 🚀 | 155 | 2.0% | |
| 🤔 | 143 | 1.9% | |
| ⃣ | 128 | 1.7% | |
| 🥳 | 122 | 1.6% | |
| 🏾 | 115 | 1.5% | |
| 🎉 | 111 | 1.5% | |
| 🌎 | 90 | 1.2% | |
| 👉 | 80 | 1.1% | |
| 🥰 | 71 | 0.9% | |
| 🔥 | 69 | 0.9% | |
| 💙 | 59 | 0.8% | |
| 👌 | 55 | 0.7% | |
| 🚨 | 54 | 0.7% | |
| 🤷 | 50 | 0.7% | |
| 💥 | 49 | 0.6% | |
| Other values (389) | 2328 | 30.7% |
Most frequent Misc Symbols characters
| Value | Count | Frequency (%) | |
| ♂ | 58 | 14.6% | |
| ♀ | 50 | 12.6% | |
| ☺ | 47 | 11.9% | |
| ⚕ | 31 | 7.8% | |
| ♥ | 30 | 7.6% | |
| ☑ | 26 | 6.6% | |
| ☝ | 25 | 6.3% | |
| ☀ | 22 | 5.6% | |
| ⚠ | 22 | 5.6% | |
| ☠ | 21 | 5.3% | |
| ⚡ | 9 | 2.3% | |
| ⛔ | 7 | 1.8% | |
| ☹ | 5 | 1.3% | |
| ⚰ | 5 | 1.3% | |
| ☕ | 4 | 1.0% | |
| ☣ | 4 | 1.0% | |
| ⚖ | 3 | 0.8% | |
| ☎ | 3 | 0.8% | |
| ☔ | 3 | 0.8% | |
| ☃ | 2 | 0.5% | |
| ⚽ | 2 | 0.5% | |
| ☄ | 2 | 0.5% | |
| ☪ | 1 | 0.3% | |
| ⛄ | 1 | 0.3% | |
| ⚾ | 1 | 0.3% | |
| Other values (12) | 12 | 3.0% |
Most frequent VS characters
| Value | Count | Frequency (%) | |
| ️ | 1137 | 99.9% | |
| ︎ | 1 | 0.1% |
Most frequent Emoticons characters
| Value | Count | Frequency (%) | |
| 🙏 | 438 | 18.9% | |
| 😂 | 319 | 13.8% | |
| 🙌 | 169 | 7.3% | |
| 😷 | 168 | 7.3% | |
| 😊 | 138 | 6.0% | |
| 😁 | 127 | 5.5% | |
| 😎 | 114 | 4.9% | |
| 😭 | 75 | 3.2% | |
| 😍 | 49 | 2.1% | |
| 😉 | 45 | 1.9% | |
| 😅 | 41 | 1.8% | |
| 😳 | 41 | 1.8% | |
| 😃 | 39 | 1.7% | |
| 😜 | 39 | 1.7% | |
| 😀 | 37 | 1.6% | |
| 🙂 | 37 | 1.6% | |
| 🙄 | 31 | 1.3% | |
| 😆 | 27 | 1.2% | |
| 😱 | 24 | 1.0% | |
| 😇 | 23 | 1.0% | |
| 😩 | 22 | 1.0% | |
| 😢 | 22 | 1.0% | |
| 😌 | 18 | 0.8% | |
| 😬 | 17 | 0.7% | |
| 😡 | 16 | 0.7% | |
| Other values (44) | 238 | 10.3% |
Most frequent Enclosed Alphanum Sup characters
| Value | Count | Frequency (%) | |
| 🇳 | 390 | 14.0% | |
| 🇺 | 319 | 11.5% | |
| 🇨 | 265 | 9.5% | |
| 🇮 | 210 | 7.5% | |
| 🇷 | 178 | 6.4% | |
| 🇸 | 157 | 5.6% | |
| 🇪 | 148 | 5.3% | |
| 🇦 | 147 | 5.3% | |
| 🇬 | 132 | 4.7% | |
| 🇧 | 122 | 4.4% | |
| 🇵 | 105 | 3.8% | |
| 🇭 | 95 | 3.4% | |
| 🇰 | 90 | 3.2% | |
| 🇹 | 76 | 2.7% | |
| 🇲 | 64 | 2.3% | |
| 🇱 | 50 | 1.8% | |
| 🇿 | 43 | 1.5% | |
| 🇩 | 41 | 1.5% | |
| 🇾 | 34 | 1.2% | |
| 🇴 | 26 | 0.9% | |
| 🇽 | 17 | 0.6% | |
| 🇼 | 14 | 0.5% | |
| 🇫 | 13 | 0.5% | |
| 🇶 | 13 | 0.5% | |
| 🇻 | 13 | 0.5% | |
| Other values (6) | 21 | 0.8% |
Most frequent Currency Symbols characters
| Value | Count | Frequency (%) | |
| € | 21 | 56.8% | |
| ₹ | 16 | 43.2% |
Most frequent Dingbats characters
| Value | Count | Frequency (%) | |
| ✅ | 305 | 33.2% | |
| ❤ | 235 | 25.5% | |
| ✔ | 111 | 12.1% | |
| ✌ | 88 | 9.6% | |
| ✈ | 43 | 4.7% | |
| ✨ | 36 | 3.9% | |
| ➡ | 27 | 2.9% | |
| ❗ | 15 | 1.6% | |
| ➖ | 12 | 1.3% | |
| ❓ | 8 | 0.9% | |
| ❄ | 5 | 0.5% | |
| ✍ | 5 | 0.5% | |
| ❣ | 4 | 0.4% | |
| ❌ | 4 | 0.4% | |
| ✊ | 3 | 0.3% | |
| ➕ | 3 | 0.3% | |
| ✝ | 3 | 0.3% | |
| ✋ | 3 | 0.3% | |
| ❝ | 2 | 0.2% | |
| ❶ | 2 | 0.2% | |
| ➌ | 1 | 0.1% | |
| ➎ | 1 | 0.1% | |
| ❷ | 1 | 0.1% | |
| ❇ | 1 | 0.1% | |
| ✉ | 1 | 0.1% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 121 | 18.9% | |
| ر | 70 | 10.9% | |
| ل | 54 | 8.4% | |
| و | 40 | 6.2% | |
| ن | 38 | 5.9% | |
| ز | 36 | 5.6% | |
| ج | 34 | 5.3% | |
| ی | 25 | 3.9% | |
| ي | 23 | 3.6% | |
| م | 21 | 3.3% | |
| ک | 16 | 2.5% | |
| ئ | 15 | 2.3% | |
| ب | 14 | 2.2% | |
| د | 14 | 2.2% | |
| ت | 10 | 1.6% | |
| ع | 9 | 1.4% | |
| خ | 8 | 1.2% | |
| ہ | 8 | 1.2% | |
| س | 7 | 1.1% | |
| ك | 7 | 1.1% | |
| ط | 7 | 1.1% | |
| ة | 6 | 0.9% | |
| ے | 6 | 0.9% | |
| ح | 5 | 0.8% | |
| ص | 5 | 0.8% | |
| Other values (17) | 42 | 6.6% |
Most frequent Latin 1 Sup characters
| Value | Count | Frequency (%) | |
| 215 | 34.4% | ||
| Ê | 90 | 14.4% | |
| í | 48 | 7.7% | |
| é | 33 | 5.3% | |
| ó | 27 | 4.3% | |
| ° | 24 | 3.8% | |
| á | 23 | 3.7% | |
| £ | 21 | 3.4% | |
| ® | 17 | 2.7% | |
| ü | 15 | 2.4% | |
| « | 11 | 1.8% | |
| » | 10 | 1.6% | |
| ñ | 10 | 1.6% | |
| ´ | 10 | 1.6% | |
| º | 7 | 1.1% | |
| · | 6 | 1.0% | |
| ö | 5 | 0.8% | |
| ¡ | 5 | 0.8% | |
| ú | 4 | 0.6% | |
| © | 4 | 0.6% | |
| ã | 4 | 0.6% | |
| è | 3 | 0.5% | |
| ô | 3 | 0.5% | |
| Ö | 3 | 0.5% | |
| ä | 3 | 0.5% | |
| Other values (16) | 24 | 3.8% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| र | 83 | 15.0% | |
| द | 45 | 8.2% | |
| े | 37 | 6.7% | |
| न | 34 | 6.2% | |
| ी | 32 | 5.8% | |
| ा | 30 | 5.4% | |
| ् | 29 | 5.3% | |
| ं | 25 | 4.5% | |
| ि | 24 | 4.3% | |
| म | 24 | 4.3% | |
| ो | 24 | 4.3% | |
| ह | 18 | 3.3% | |
| क | 14 | 2.5% | |
| ब | 14 | 2.5% | |
| ज | 12 | 2.2% | |
| त | 12 | 2.2% | |
| स | 10 | 1.8% | |
| ग | 9 | 1.6% | |
| व | 8 | 1.4% | |
| ए | 8 | 1.4% | |
| ल | 8 | 1.4% | |
| प | 6 | 1.1% | |
| औ | 6 | 1.1% | |
| भ | 5 | 0.9% | |
| य | 4 | 0.7% | |
| Other values (16) | 31 | 5.6% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| ー | 124 | 71.3% | |
| ン | 4 | 2.3% | |
| ・ | 4 | 2.3% | |
| コ | 3 | 1.7% | |
| ナ | 3 | 1.7% | |
| ウ | 3 | 1.7% | |
| イ | 3 | 1.7% | |
| ル | 3 | 1.7% | |
| ク | 3 | 1.7% | |
| ロ | 2 | 1.1% | |
| ス | 2 | 1.1% | |
| ワ | 2 | 1.1% | |
| チ | 2 | 1.1% | |
| ト | 2 | 1.1% | |
| モ | 1 | 0.6% | |
| デ | 1 | 0.6% | |
| レ | 1 | 0.6% | |
| ジ | 1 | 0.6% | |
| ャ | 1 | 0.6% | |
| ゲ | 1 | 0.6% | |
| ム | 1 | 0.6% | |
| ラ | 1 | 0.6% | |
| マ | 1 | 0.6% | |
| ア | 1 | 0.6% | |
| オ | 1 | 0.6% | |
| Other values (3) | 3 | 1.7% |
Most frequent Geometric Shapes characters
| Value | Count | Frequency (%) | |
| ▪ | 12 | 35.3% | |
| ● | 8 | 23.5% | |
| ▶ | 6 | 17.6% | |
| ◆ | 3 | 8.8% | |
| ► | 2 | 5.9% | |
| ▫ | 2 | 5.9% | |
| ■ | 1 | 2.9% |
Most frequent Latin Ext A characters
| Value | Count | Frequency (%) | |
| ğ | 11 | 18.3% | |
| š | 9 | 15.0% | |
| Ş | 8 | 13.3% | |
| č | 8 | 13.3% | |
| ı | 7 | 11.7% | |
| ć | 6 | 10.0% | |
| ş | 4 | 6.7% | |
| İ | 3 | 5.0% | |
| ď | 1 | 1.7% | |
| ā | 1 | 1.7% | |
| ē | 1 | 1.7% | |
| ą | 1 | 1.7% |
Most frequent Letterlike Symbols characters
| Value | Count | Frequency (%) | |
| ℹ | 2 | 33.3% | |
| ™ | 2 | 33.3% | |
| ℅ | 1 | 16.7% | |
| № | 1 | 16.7% |
Most frequent Diacriticals characters
| Value | Count | Frequency (%) | |
| ̇ | 7 | 38.9% | |
| ̶ | 7 | 38.9% | |
| ͟ | 4 | 22.2% |
Most frequent Arrows characters
| Value | Count | Frequency (%) | |
| → | 6 | 75.0% | |
| ↗ | 1 | 12.5% | |
| ↬ | 1 | 12.5% |
Most frequent Braille characters
| Value | Count | Frequency (%) | |
| ⠀ | 5 | 100.0% |
Most frequent Sup Arrows B characters
| Value | Count | Frequency (%) | |
| ⤵ | 7 | 77.8% | |
| ⤴ | 2 | 22.2% |
Most frequent Number Forms characters
| Value | Count | Frequency (%) | |
| ⅓ | 1 | 100.0% |
Most frequent Geometric Shapes Ext characters
| Value | Count | Frequency (%) | |
| 🟢 | 4 | 57.1% | |
| 🟣 | 1 | 14.3% | |
| 🟠 | 1 | 14.3% | |
| 🟩 | 1 | 14.3% |
Most frequent CJK characters
| Value | Count | Frequency (%) | |
| 疫 | 4 | 3.3% | |
| 科 | 4 | 3.3% | |
| 興 | 4 | 3.3% | |
| 彩 | 4 | 3.3% | |
| 来 | 3 | 2.5% | |
| 自 | 3 | 2.5% | |
| 苗 | 3 | 2.5% | |
| 新 | 2 | 1.7% | |
| 型 | 2 | 1.7% | |
| 支 | 2 | 1.7% | |
| 募 | 2 | 1.7% | |
| 集 | 2 | 1.7% | |
| 圖 | 2 | 1.7% | |
| 派 | 2 | 1.7% | |
| 六 | 2 | 1.7% | |
| 合 | 2 | 1.7% | |
| 連 | 2 | 1.7% | |
| 豬 | 2 | 1.7% | |
| 藍 | 2 | 1.7% | |
| 絲 | 2 | 1.7% | |
| 相 | 2 | 1.7% | |
| 信 | 2 | 1.7% | |
| 政 | 2 | 1.7% | |
| 府 | 2 | 1.7% | |
| 接 | 2 | 1.7% | |
| Other values (53) | 60 | 49.6% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| о | 15 | 13.6% | |
| а | 8 | 7.3% | |
| с | 8 | 7.3% | |
| в | 7 | 6.4% | |
| к | 6 | 5.5% | |
| и | 6 | 5.5% | |
| д | 6 | 5.5% | |
| н | 6 | 5.5% | |
| л | 5 | 4.5% | |
| р | 4 | 3.6% | |
| у | 4 | 3.6% | |
| т | 4 | 3.6% | |
| е | 4 | 3.6% | |
| П | 3 | 2.7% | |
| м | 3 | 2.7% | |
| Р | 2 | 1.8% | |
| ж | 2 | 1.8% | |
| ч | 2 | 1.8% | |
| б | 2 | 1.8% | |
| В | 1 | 0.9% | |
| і | 1 | 0.9% | |
| С | 1 | 0.9% | |
| Я | 1 | 0.9% | |
| я | 1 | 0.9% | |
| г | 1 | 0.9% | |
| Other values (7) | 7 | 6.4% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 이 | 9 | 9.2% | |
| 소 | 5 | 5.1% | |
| 스 | 5 | 5.1% | |
| 트 | 5 | 5.1% | |
| 해 | 3 | 3.1% | |
| 방 | 3 | 3.1% | |
| 탄 | 3 | 3.1% | |
| 년 | 3 | 3.1% | |
| 단 | 3 | 3.1% | |
| 와 | 3 | 3.1% | |
| 꽃 | 2 | 2.0% | |
| 불 | 2 | 2.0% | |
| 놀 | 2 | 2.0% | |
| 야 | 2 | 2.0% | |
| 달 | 2 | 2.0% | |
| 의 | 2 | 2.0% | |
| 녀 | 2 | 2.0% | |
| 레 | 2 | 2.0% | |
| 키 | 2 | 2.0% | |
| 즈 | 2 | 2.0% | |
| 코 | 1 | 1.0% | |
| 로 | 1 | 1.0% | |
| 나 | 1 | 1.0% | |
| 그 | 1 | 1.0% | |
| 리 | 1 | 1.0% | |
| Other values (31) | 31 | 31.6% |
Most frequent Phonetic Ext characters
| Value | Count | Frequency (%) | |
| ᴠ | 87 | 27.3% | |
| ᴇ | 86 | 27.0% | |
| ᴀ | 65 | 20.4% | |
| ᴄ | 65 | 20.4% | |
| ᴛ | 6 | 1.9% | |
| ᴅ | 5 | 1.6% | |
| ᴍ | 3 | 0.9% | |
| ᴏ | 2 | 0.6% |
Most frequent IPA Ext characters
| Value | Count | Frequency (%) | |
| ɪ | 64 | 47.4% | |
| ɴ | 33 | 24.4% | |
| ʟ | 27 | 20.0% | |
| ə | 4 | 3.0% | |
| ɡ | 2 | 1.5% | |
| ʌ | 2 | 1.5% | |
| ʘ | 2 | 1.5% | |
| ɢ | 1 | 0.7% |
Most frequent Math Alphanum characters
| Value | Count | Frequency (%) | |
| 𝙖 | 37 | 4.3% | |
| 𝙣 | 32 | 3.7% | |
| 𝙮 | 25 | 2.9% | |
| 𝗮 | 24 | 2.8% | |
| 𝗼 | 23 | 2.7% | |
| 𝗲 | 22 | 2.6% | |
| 𝙚 | 22 | 2.6% | |
| 𝙞 | 22 | 2.6% | |
| 𝙤 | 21 | 2.5% | |
| 𝙙 | 21 | 2.5% | |
| 𝙧 | 19 | 2.2% | |
| 𝗶 | 18 | 2.1% | |
| 𝗻 | 18 | 2.1% | |
| 𝘀 | 17 | 2.0% | |
| 𝙢 | 16 | 1.9% | |
| 𝙏 | 16 | 1.9% | |
| 𝘁 | 15 | 1.8% | |
| 𝗿 | 15 | 1.8% | |
| 𝐢 | 14 | 1.6% | |
| 𝗰 | 13 | 1.5% | |
| 𝙈 | 12 | 1.4% | |
| 𝐞 | 12 | 1.4% | |
| 𝐚 | 12 | 1.4% | |
| 𝗜 | 11 | 1.3% | |
| 𝗔 | 11 | 1.3% | |
| Other values (131) | 388 | 45.3% |
Most frequent Misc Technical characters
| Value | Count | Frequency (%) | |
| ⏳ | 12 | 75.0% | |
| ⏰ | 2 | 12.5% | |
| ⌛ | 1 | 6.2% | |
| ⏬ | 1 | 6.2% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| ் | 14 | 15.9% | |
| க | 8 | 9.1% | |
| ு | 8 | 9.1% | |
| த | 6 | 6.8% | |
| ை | 4 | 4.5% | |
| ன | 4 | 4.5% | |
| ச | 4 | 4.5% | |
| வ | 3 | 3.4% | |
| ர | 3 | 3.4% | |
| ய | 3 | 3.4% | |
| ந | 3 | 3.4% | |
| ி | 3 | 3.4% | |
| ப | 2 | 2.3% | |
| ம | 2 | 2.3% | |
| அ | 2 | 2.3% | |
| ள | 2 | 2.3% | |
| ா | 2 | 2.3% | |
| ெ | 2 | 2.3% | |
| ே | 2 | 2.3% | |
| ட | 2 | 2.3% | |
| ல | 2 | 2.3% | |
| இ | 2 | 2.3% | |
| ஞ | 1 | 1.1% | |
| ோ | 1 | 1.1% | |
| ஒ | 1 | 1.1% | |
| Other values (2) | 2 | 2.3% |
Most frequent Sinhala characters
| Value | Count | Frequency (%) | |
| ් | 4 | 16.0% | |
| න | 3 | 12.0% | |
| ෙ | 3 | 12.0% | |
| ක | 2 | 8.0% | |
| ල | 2 | 8.0% | |
| ා | 2 | 8.0% | |
| ස | 2 | 8.0% | |
| බ | 1 | 4.0% | |
| ෑ | 1 | 4.0% | |
| ම | 1 | 4.0% | |
| ය | 1 | 4.0% | |
| ි | 1 | 4.0% | |
| ප | 1 | 4.0% | |
| ඩ | 1 | 4.0% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| ว | 17 | 9.7% | |
| ค | 15 | 8.5% | |
| น | 13 | 7.4% | |
| ี | 10 | 5.7% | |
| ิ | 10 | 5.7% | |
| ั | 9 | 5.1% | |
| โ | 9 | 5.1% | |
| ่ | 8 | 4.5% | |
| ด | 8 | 4.5% | |
| ซ | 5 | 2.8% | |
| อ | 5 | 2.8% | |
| ร | 5 | 2.8% | |
| า | 5 | 2.8% | |
| ้ | 4 | 2.3% | |
| ก | 4 | 2.3% | |
| ม | 4 | 2.3% | |
| จ | 4 | 2.3% | |
| ย | 3 | 1.7% | |
| ู | 3 | 1.7% | |
| ต | 3 | 1.7% | |
| ไ | 3 | 1.7% | |
| เ | 3 | 1.7% | |
| ง | 2 | 1.1% | |
| แ | 2 | 1.1% | |
| ป | 2 | 1.1% | |
| Other values (15) | 20 | 11.4% |
Most frequent Tags characters
| Value | Count | Frequency (%) | |
| | 2 | 16.7% | |
| | 2 | 16.7% | |
| | 2 | 16.7% | |
| | 2 | 16.7% | |
| | 2 | 16.7% | |
| | 2 | 16.7% |
Most frequent Math Operators characters
| Value | Count | Frequency (%) | |
| ≥ | 2 | 66.7% | |
| ≈ | 1 | 33.3% |
Most frequent Modifier Letters characters
| Value | Count | Frequency (%) | |
| ˈ | 7 | 87.5% | |
| ˌ | 1 | 12.5% |
Most frequent Myanmar characters
| Value | Count | Frequency (%) | |
| ် | 8 | 11.9% | |
| ု | 6 | 9.0% | |
| ိ | 6 | 9.0% | |
| င | 5 | 7.5% | |
| က | 5 | 7.5% | |
| န | 4 | 6.0% | |
| ာ | 4 | 6.0% | |
| ရ | 3 | 4.5% | |
| တ | 2 | 3.0% | |
| ံ | 2 | 3.0% | |
| ွ | 2 | 3.0% | |
| ဆ | 2 | 3.0% | |
| ေ | 2 | 3.0% | |
| း | 2 | 3.0% | |
| မ | 2 | 3.0% | |
| ီ | 2 | 3.0% | |
| ၏ | 1 | 1.5% | |
| ဗ | 1 | 1.5% | |
| စ | 1 | 1.5% | |
| ယ | 1 | 1.5% | |
| ျ | 1 | 1.5% | |
| ဒ | 1 | 1.5% | |
| သ | 1 | 1.5% | |
| ့ | 1 | 1.5% | |
| ှ | 1 | 1.5% |
Most frequent Enclosed Alphanum characters
| Value | Count | Frequency (%) | |
| ⓿ | 1 | 100.0% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| ま | 2 | 10.0% | |
| え | 2 | 10.0% | |
| ら | 2 | 10.0% | |
| の | 2 | 10.0% | |
| お | 1 | 5.0% | |
| げ | 1 | 5.0% | |
| て | 1 | 5.0% | |
| け | 1 | 5.0% | |
| た | 1 | 5.0% | |
| に | 1 | 5.0% | |
| は | 1 | 5.0% | |
| し | 1 | 5.0% | |
| い | 1 | 5.0% | |
| こ | 1 | 5.0% | |
| と | 1 | 5.0% | |
| よ | 1 | 5.0% |
Most frequent Block Elements characters
| Value | Count | Frequency (%) | |
| █ | 6 | 50.0% | |
| ▒ | 6 | 50.0% |
Most frequent Telugu characters
| Value | Count | Frequency (%) | |
| క | 4 | 17.4% | |
| ా | 3 | 13.0% | |
| ి | 2 | 8.7% | |
| ్ | 2 | 8.7% | |
| అ | 1 | 4.3% | |
| మ | 1 | 4.3% | |
| ె | 1 | 4.3% | |
| ర | 1 | 4.3% | |
| ు | 1 | 4.3% | |
| ొ | 1 | 4.3% | |
| వ | 1 | 4.3% | |
| గ | 1 | 4.3% | |
| జ | 1 | 4.3% | |
| న | 1 | 4.3% | |
| ట | 1 | 4.3% | |
| ీ | 1 | 4.3% |
Most frequent Kannada characters
| Value | Count | Frequency (%) | |
| ರ | 3 | 16.7% | |
| ಿ | 2 | 11.1% | |
| ೆ | 2 | 11.1% | |
| ್ | 2 | 11.1% | |
| ಲ | 1 | 5.6% | |
| ಸ | 1 | 5.6% | |
| ಕ | 1 | 5.6% | |
| ಖ | 1 | 5.6% | |
| ೀ | 1 | 5.6% | |
| ದ | 1 | 5.6% | |
| ಗ | 1 | 5.6% | |
| ಆ | 1 | 5.6% | |
| ಡ | 1 | 5.6% |
Most frequent Arabic PF B characters
| Value | Count | Frequency (%) | |
| ﻌ | 1 | 100.0% |
Most frequent Box Drawing characters
| Value | Count | Frequency (%) | |
| ─ | 1 | 100.0% |
Most frequent UCAS characters
| Value | Count | Frequency (%) | |
| ᗩ | 1 | 100.0% |
Most frequent Alphabetic PF characters
| Value | Count | Frequency (%) | |
| ffi | 1 | 100.0% |
| Distinct | 16835 |
|---|---|
| Distinct (%) | 46.5% |
| Missing | 9816 |
| Missing (%) | 21.3% |
| Memory size | 360.0 KiB |
| ['Moderna'] | 2160 |
|---|---|
| ['Covaxin'] | 1705 |
| ['SputnikV'] | 1647 |
| ['PfizerBioNTech'] | 853 |
| ['OxfordAstraZeneca'] | 606 |
| Other values (16830) |
| Value | Count | Frequency (%) | |
| ['Moderna'] | 2160 | 4.7% | |
| ['Covaxin'] | 1705 | 3.7% | |
| ['SputnikV'] | 1647 | 3.6% | |
| ['PfizerBioNTech'] | 853 | 1.9% | |
| ['OxfordAstraZeneca'] | 606 | 1.3% | |
| ['COVID19'] | 586 | 1.3% | |
| ['moderna'] | 521 | 1.1% | |
| ['Sinopharm'] | 401 | 0.9% | |
| ['Sinovac'] | 401 | 0.9% | |
| ['COVAXIN'] | 363 | 0.8% | |
| ['covaxin'] | 246 | 0.5% | |
| ['oxfordastrazeneca'] | 230 | 0.5% | |
| ['Pfizer', 'Moderna'] | 187 | 0.4% | |
| ['PfizerBiontech'] | 166 | 0.4% | |
| ['vaccine'] | 145 | 0.3% | |
| ['Moderna', 'vaccine'] | 142 | 0.3% | |
| ['CovidVaccine'] | 128 | 0.3% | |
| ['AstraZeneca'] | 115 | 0.2% | |
| ['Moderna', 'CovidVaccine'] | 105 | 0.2% | |
| ['Covishield', 'Covaxin'] | 103 | 0.2% | |
| ['Moderna', 'COVID19'] | 103 | 0.2% | |
| ['CovidVaccine', 'Moderna'] | 91 | 0.2% | |
| ['PfizerBioNTech', 'CovidVaccine'] | 87 | 0.2% | |
| ['Pfizer'] | 81 | 0.2% | |
| ['COVID19', 'vaccine'] | 79 | 0.2% | |
| Other values (16810) | 24992 | 54.3% | |
| (Missing) | 9816 | 21.3% |
Frequencies of value counts
Unique
| Unique | 14702 ? |
|---|---|
| Unique (%) | 40.6% |
Histogram of lengths of the category
Length
| Max length | 142 |
|---|---|
| Median length | 21 |
| Mean length | 24.66050066 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| ' | 171636 | 15.1% | |
| n | 81479 | 7.2% | |
| a | 78287 | 6.9% | |
| i | 66236 | 5.8% | |
| e | 60138 | 5.3% | |
| o | 50948 | 4.5% | |
| , | 49575 | 4.4% | |
| 49575 | 4.4% | ||
| c | 47906 | 4.2% | |
| r | 40213 | 3.5% | |
| [ | 36243 | 3.2% | |
| ] | 36243 | 3.2% | |
| d | 25966 | 2.3% | |
| t | 23752 | 2.1% | |
| v | 23597 | 2.1% | |
| C | 21421 | 1.9% | |
| s | 21221 | 1.9% | |
| V | 20446 | 1.8% | |
| h | 15203 | 1.3% | |
| u | 12215 | 1.1% | |
| S | 11469 | 1.0% | |
| f | 11291 | 1.0% | |
| I | 11279 | 1.0% | |
| O | 11270 | 1.0% | |
| M | 10323 | 0.9% | |
| Other values (417) | 147906 | 13.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 622917 | 54.8% | |
| Other Punctuation | 221211 | 19.5% | |
| Uppercase Letter | 149700 | 13.2% | |
| Space Separator | 49575 | 4.4% | |
| Open Punctuation | 36243 | 3.2% | |
| Close Punctuation | 36243 | 3.2% | |
| Decimal Number | 17916 | 1.6% | |
| Other Letter | 1114 | 0.1% | |
| Connector Punctuation | 646 | 0.1% | |
| Modifier Letter | 123 | < 0.1% | |
| Nonspacing Mark | 98 | < 0.1% | |
| Spacing Mark | 52 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| [ | 36243 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 171636 | 77.6% | |
| , | 49575 | 22.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 21421 | 14.3% | |
| V | 20446 | 13.7% | |
| S | 11469 | 7.7% | |
| I | 11279 | 7.5% | |
| O | 11270 | 7.5% | |
| M | 10323 | 6.9% | |
| D | 9377 | 6.3% | |
| P | 8929 | 6.0% | |
| N | 7162 | 4.8% | |
| A | 7097 | 4.7% | |
| B | 6867 | 4.6% | |
| T | 5396 | 3.6% | |
| Z | 2979 | 2.0% | |
| R | 2220 | 1.5% | |
| E | 2173 | 1.5% | |
| U | 1794 | 1.2% | |
| H | 1599 | 1.1% | |
| G | 1446 | 1.0% | |
| F | 1208 | 0.8% | |
| K | 1163 | 0.8% | |
| L | 998 | 0.7% | |
| X | 946 | 0.6% | |
| W | 907 | 0.6% | |
| J | 824 | 0.6% | |
| Y | 298 | 0.2% | |
| Other values (15) | 109 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 81479 | 13.1% | |
| a | 78287 | 12.6% | |
| i | 66236 | 10.6% | |
| e | 60138 | 9.7% | |
| o | 50948 | 8.2% | |
| c | 47906 | 7.7% | |
| r | 40213 | 6.5% | |
| d | 25966 | 4.2% | |
| t | 23752 | 3.8% | |
| v | 23597 | 3.8% | |
| s | 21221 | 3.4% | |
| h | 15203 | 2.4% | |
| u | 12215 | 2.0% | |
| f | 11291 | 1.8% | |
| p | 10188 | 1.6% | |
| l | 8589 | 1.4% | |
| m | 8458 | 1.4% | |
| z | 8344 | 1.3% | |
| x | 7787 | 1.3% | |
| k | 7477 | 1.2% | |
| g | 4068 | 0.7% | |
| y | 3649 | 0.6% | |
| b | 2497 | 0.4% | |
| w | 1931 | 0.3% | |
| j | 675 | 0.1% | |
| Other values (76) | 802 | 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ] | 36243 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 49575 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 646 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 8475 | 47.3% | |
| 9 | 8032 | 44.8% | |
| 2 | 689 | 3.8% | |
| 0 | 290 | 1.6% | |
| 4 | 117 | 0.7% | |
| 5 | 88 | 0.5% | |
| 3 | 76 | 0.4% | |
| 7 | 67 | 0.4% | |
| 8 | 50 | 0.3% | |
| 6 | 32 | 0.2% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| ا | 115 | 10.3% | |
| ر | 66 | 5.9% | |
| ل | 52 | 4.7% | |
| र | 43 | 3.9% | |
| و | 37 | 3.3% | |
| ز | 35 | 3.1% | |
| ن | 33 | 3.0% | |
| ج | 32 | 2.9% | |
| ي | 23 | 2.1% | |
| द | 21 | 1.9% | |
| م | 20 | 1.8% | |
| ی | 19 | 1.7% | |
| ว | 16 | 1.4% | |
| ئ | 15 | 1.3% | |
| ک | 13 | 1.2% | |
| ب | 13 | 1.2% | |
| د | 13 | 1.2% | |
| ค | 13 | 1.2% | |
| น | 12 | 1.1% | |
| न | 12 | 1.1% | |
| ब | 11 | 1.0% | |
| 이 | 9 | 0.8% | |
| ت | 8 | 0.7% | |
| ज | 8 | 0.7% | |
| ع | 8 | 0.7% | |
| Other values (230) | 467 | 41.9% |
Most frequent Modifier Letter characters
| Value | Count | Frequency (%) | |
| ー | 123 | 100.0% |
Most frequent Nonspacing Mark characters
| Value | Count | Frequency (%) | |
| े | 15 | 15.3% | |
| ् | 12 | 12.2% | |
| ं | 10 | 10.2% | |
| ั | 9 | 9.2% | |
| ี | 9 | 9.2% | |
| ิ | 9 | 9.2% | |
| ่ | 7 | 7.1% | |
| ̇ | 5 | 5.1% | |
| ် | 4 | 4.1% | |
| ု | 3 | 3.1% | |
| ้ | 3 | 3.1% | |
| ိ | 2 | 2.0% | |
| ็ | 2 | 2.0% | |
| ู | 2 | 2.0% | |
| ๊ | 2 | 2.0% | |
| ံ | 1 | 1.0% | |
| ွ | 1 | 1.0% | |
| ू | 1 | 1.0% | |
| ์ | 1 | 1.0% |
Most frequent Spacing Mark characters
| Value | Count | Frequency (%) | |
| ी | 12 | 23.1% | |
| ि | 12 | 23.1% | |
| ा | 11 | 21.2% | |
| ो | 9 | 17.3% | |
| ာ | 2 | 3.8% | |
| း | 2 | 3.8% | |
| ை | 1 | 1.9% | |
| ே | 1 | 1.9% | |
| ေ | 1 | 1.9% | |
| ျ | 1 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 772436 | 68.0% | |
| Common | 361957 | 31.9% | |
| Arabic | 585 | 0.1% | |
| Devanagari | 227 | < 0.1% | |
| Thai | 145 | < 0.1% | |
| Cyrillic | 106 | < 0.1% | |
| Han | 104 | < 0.1% | |
| Hangul | 95 | < 0.1% | |
| Greek | 75 | < 0.1% | |
| Katakana | 46 | < 0.1% | |
| Myanmar | 31 | < 0.1% | |
| Hiragana | 20 | < 0.1% | |
| Tamil | 6 | < 0.1% | |
| Inherited | 5 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| ' | 171636 | 47.4% | |
| , | 49575 | 13.7% | |
| 49575 | 13.7% | ||
| [ | 36243 | 10.0% | |
| ] | 36243 | 10.0% | |
| 1 | 8475 | 2.3% | |
| 9 | 8032 | 2.2% | |
| 2 | 689 | 0.2% | |
| _ | 646 | 0.2% | |
| 0 | 290 | 0.1% | |
| ー | 123 | < 0.1% | |
| 4 | 117 | < 0.1% | |
| 5 | 88 | < 0.1% | |
| 3 | 76 | < 0.1% | |
| 7 | 67 | < 0.1% | |
| 8 | 50 | < 0.1% | |
| 6 | 32 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 81479 | 10.5% | |
| a | 78287 | 10.1% | |
| i | 66236 | 8.6% | |
| e | 60138 | 7.8% | |
| o | 50948 | 6.6% | |
| c | 47906 | 6.2% | |
| r | 40213 | 5.2% | |
| d | 25966 | 3.4% | |
| t | 23752 | 3.1% | |
| v | 23597 | 3.1% | |
| C | 21421 | 2.8% | |
| s | 21221 | 2.7% | |
| V | 20446 | 2.6% | |
| h | 15203 | 2.0% | |
| u | 12215 | 1.6% | |
| S | 11469 | 1.5% | |
| f | 11291 | 1.5% | |
| I | 11279 | 1.5% | |
| O | 11270 | 1.5% | |
| M | 10323 | 1.3% | |
| p | 10188 | 1.3% | |
| D | 9377 | 1.2% | |
| P | 8929 | 1.2% | |
| l | 8589 | 1.1% | |
| m | 8458 | 1.1% | |
| Other values (62) | 82235 | 10.6% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 115 | 19.7% | |
| ر | 66 | 11.3% | |
| ل | 52 | 8.9% | |
| و | 37 | 6.3% | |
| ز | 35 | 6.0% | |
| ن | 33 | 5.6% | |
| ج | 32 | 5.5% | |
| ي | 23 | 3.9% | |
| م | 20 | 3.4% | |
| ی | 19 | 3.2% | |
| ئ | 15 | 2.6% | |
| ک | 13 | 2.2% | |
| ب | 13 | 2.2% | |
| د | 13 | 2.2% | |
| ت | 8 | 1.4% | |
| ع | 8 | 1.4% | |
| خ | 7 | 1.2% | |
| ك | 7 | 1.2% | |
| ط | 7 | 1.2% | |
| ے | 6 | 1.0% | |
| س | 5 | 0.9% | |
| ح | 5 | 0.9% | |
| ة | 5 | 0.9% | |
| ش | 4 | 0.7% | |
| چ | 4 | 0.7% | |
| Other values (11) | 33 | 5.6% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| о | 15 | 14.2% | |
| а | 8 | 7.5% | |
| в | 7 | 6.6% | |
| с | 7 | 6.6% | |
| к | 6 | 5.7% | |
| и | 6 | 5.7% | |
| д | 6 | 5.7% | |
| н | 6 | 5.7% | |
| л | 5 | 4.7% | |
| р | 4 | 3.8% | |
| у | 4 | 3.8% | |
| т | 4 | 3.8% | |
| е | 4 | 3.8% | |
| м | 3 | 2.8% | |
| П | 2 | 1.9% | |
| ж | 2 | 1.9% | |
| ч | 2 | 1.9% | |
| б | 2 | 1.9% | |
| і | 1 | 0.9% | |
| С | 1 | 0.9% | |
| Я | 1 | 0.9% | |
| Р | 1 | 0.9% | |
| я | 1 | 0.9% | |
| г | 1 | 0.9% | |
| п | 1 | 0.9% | |
| Other values (6) | 6 | 5.7% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 이 | 9 | 9.5% | |
| 소 | 5 | 5.3% | |
| 스 | 5 | 5.3% | |
| 트 | 5 | 5.3% | |
| 해 | 3 | 3.2% | |
| 방 | 3 | 3.2% | |
| 탄 | 3 | 3.2% | |
| 년 | 3 | 3.2% | |
| 단 | 3 | 3.2% | |
| 와 | 3 | 3.2% | |
| 꽃 | 2 | 2.1% | |
| 불 | 2 | 2.1% | |
| 놀 | 2 | 2.1% | |
| 야 | 2 | 2.1% | |
| 달 | 2 | 2.1% | |
| 의 | 2 | 2.1% | |
| 녀 | 2 | 2.1% | |
| 레 | 2 | 2.1% | |
| 키 | 2 | 2.1% | |
| 즈 | 2 | 2.1% | |
| 코 | 1 | 1.1% | |
| 로 | 1 | 1.1% | |
| 나 | 1 | 1.1% | |
| 영 | 1 | 1.1% | |
| 국 | 1 | 1.1% | |
| Other values (28) | 28 | 29.5% |
Most frequent Han characters
| Value | Count | Frequency (%) | |
| 疫 | 4 | 3.8% | |
| 科 | 4 | 3.8% | |
| 興 | 4 | 3.8% | |
| 彩 | 4 | 3.8% | |
| 苗 | 3 | 2.9% | |
| 新 | 2 | 1.9% | |
| 型 | 2 | 1.9% | |
| 支 | 2 | 1.9% | |
| 募 | 2 | 1.9% | |
| 集 | 2 | 1.9% | |
| 派 | 2 | 1.9% | |
| 六 | 2 | 1.9% | |
| 合 | 2 | 1.9% | |
| 連 | 2 | 1.9% | |
| 豬 | 2 | 1.9% | |
| 藍 | 2 | 1.9% | |
| 絲 | 2 | 1.9% | |
| 相 | 2 | 1.9% | |
| 信 | 2 | 1.9% | |
| 政 | 2 | 1.9% | |
| 府 | 2 | 1.9% | |
| 接 | 2 | 1.9% | |
| 種 | 2 | 1.9% | |
| 死 | 2 | 1.9% | |
| 亡 | 2 | 1.9% | |
| Other values (42) | 45 | 43.3% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| ン | 4 | 8.7% | |
| コ | 3 | 6.5% | |
| ナ | 3 | 6.5% | |
| ウ | 3 | 6.5% | |
| イ | 3 | 6.5% | |
| ル | 3 | 6.5% | |
| ク | 3 | 6.5% | |
| ロ | 2 | 4.3% | |
| ス | 2 | 4.3% | |
| ワ | 2 | 4.3% | |
| チ | 2 | 4.3% | |
| ト | 2 | 4.3% | |
| モ | 1 | 2.2% | |
| デ | 1 | 2.2% | |
| レ | 1 | 2.2% | |
| ジ | 1 | 2.2% | |
| ャ | 1 | 2.2% | |
| ゲ | 1 | 2.2% | |
| ム | 1 | 2.2% | |
| ラ | 1 | 2.2% | |
| マ | 1 | 2.2% | |
| ア | 1 | 2.2% | |
| オ | 1 | 2.2% | |
| リ | 1 | 2.2% | |
| ピ | 1 | 2.2% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| ว | 16 | 11.0% | |
| ค | 13 | 9.0% | |
| น | 12 | 8.3% | |
| ั | 9 | 6.2% | |
| ี | 9 | 6.2% | |
| ิ | 9 | 6.2% | |
| โ | 7 | 4.8% | |
| ่ | 7 | 4.8% | |
| ด | 6 | 4.1% | |
| ซ | 5 | 3.4% | |
| ม | 4 | 2.8% | |
| า | 4 | 2.8% | |
| จ | 4 | 2.8% | |
| ้ | 3 | 2.1% | |
| อ | 3 | 2.1% | |
| ร | 3 | 2.1% | |
| ก | 3 | 2.1% | |
| เ | 3 | 2.1% | |
| แ | 2 | 1.4% | |
| ็ | 2 | 1.4% | |
| ู | 2 | 1.4% | |
| ไ | 2 | 1.4% | |
| ๊ | 2 | 1.4% | |
| ต | 2 | 1.4% | |
| ล | 2 | 1.4% | |
| Other values (10) | 11 | 7.6% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| र | 43 | 18.9% | |
| द | 21 | 9.3% | |
| े | 15 | 6.6% | |
| न | 12 | 5.3% | |
| ् | 12 | 5.3% | |
| ी | 12 | 5.3% | |
| ि | 12 | 5.3% | |
| ब | 11 | 4.8% | |
| ा | 11 | 4.8% | |
| ं | 10 | 4.4% | |
| ो | 9 | 4.0% | |
| ज | 8 | 3.5% | |
| क | 7 | 3.1% | |
| ए | 6 | 2.6% | |
| औ | 6 | 2.6% | |
| ह | 6 | 2.6% | |
| म | 5 | 2.2% | |
| ग | 5 | 2.2% | |
| व | 5 | 2.2% | |
| स | 5 | 2.2% | |
| त | 2 | 0.9% | |
| य | 1 | 0.4% | |
| प | 1 | 0.4% | |
| ू | 1 | 0.4% | |
| ण | 1 | 0.4% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| த | 1 | 16.7% | |
| ல | 1 | 16.7% | |
| ை | 1 | 16.7% | |
| வ | 1 | 16.7% | |
| ர | 1 | 16.7% | |
| ே | 1 | 16.7% |
Most frequent Myanmar characters
| Value | Count | Frequency (%) | |
| ် | 4 | 12.9% | |
| ု | 3 | 9.7% | |
| က | 3 | 9.7% | |
| တ | 2 | 6.5% | |
| ိ | 2 | 6.5% | |
| င | 2 | 6.5% | |
| ာ | 2 | 6.5% | |
| း | 2 | 6.5% | |
| ရ | 1 | 3.2% | |
| န | 1 | 3.2% | |
| ံ | 1 | 3.2% | |
| ဗ | 1 | 3.2% | |
| စ | 1 | 3.2% | |
| ွ | 1 | 3.2% | |
| ယ | 1 | 3.2% | |
| ဆ | 1 | 3.2% | |
| ေ | 1 | 3.2% | |
| မ | 1 | 3.2% | |
| ျ | 1 | 3.2% |
Most frequent Greek characters
| Value | Count | Frequency (%) | |
| ο | 13 | 17.3% | |
| μ | 9 | 12.0% | |
| ι | 6 | 8.0% | |
| α | 5 | 6.7% | |
| ς | 4 | 5.3% | |
| ε | 4 | 5.3% | |
| β | 4 | 5.3% | |
| λ | 4 | 5.3% | |
| σ | 4 | 5.3% | |
| κ | 2 | 2.7% | |
| ρ | 2 | 2.7% | |
| ν | 2 | 2.7% | |
| Δ | 2 | 2.7% | |
| Μ | 2 | 2.7% | |
| Ε | 2 | 2.7% | |
| υ | 2 | 2.7% | |
| τ | 2 | 2.7% | |
| ω | 1 | 1.3% | |
| Θ | 1 | 1.3% | |
| έ | 1 | 1.3% | |
| Σ | 1 | 1.3% | |
| π | 1 | 1.3% | |
| ί | 1 | 1.3% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| ま | 2 | 10.0% | |
| え | 2 | 10.0% | |
| ら | 2 | 10.0% | |
| の | 2 | 10.0% | |
| お | 1 | 5.0% | |
| げ | 1 | 5.0% | |
| て | 1 | 5.0% | |
| け | 1 | 5.0% | |
| た | 1 | 5.0% | |
| に | 1 | 5.0% | |
| は | 1 | 5.0% | |
| し | 1 | 5.0% | |
| い | 1 | 5.0% | |
| こ | 1 | 5.0% | |
| と | 1 | 5.0% | |
| よ | 1 | 5.0% |
Most frequent Inherited characters
| Value | Count | Frequency (%) | |
| ̇ | 5 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1133753 | 99.8% | |
| Arabic | 585 | 0.1% | |
| Phonetic Ext | 318 | < 0.1% | |
| Devanagari | 227 | < 0.1% | |
| Katakana | 169 | < 0.1% | |
| Thai | 145 | < 0.1% | |
| IPA Ext | 125 | < 0.1% | |
| Cyrillic | 106 | < 0.1% | |
| CJK | 104 | < 0.1% | |
| Hangul | 95 | < 0.1% | |
| None | 75 | < 0.1% | |
| Latin 1 Sup | 60 | < 0.1% | |
| Myanmar | 31 | < 0.1% | |
| Hiragana | 20 | < 0.1% | |
| Latin Ext A | 14 | < 0.1% | |
| Tamil | 6 | < 0.1% | |
| Diacriticals | 5 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| ' | 171636 | 15.1% | |
| n | 81479 | 7.2% | |
| a | 78287 | 6.9% | |
| i | 66236 | 5.8% | |
| e | 60138 | 5.3% | |
| o | 50948 | 4.5% | |
| , | 49575 | 4.4% | |
| 49575 | 4.4% | ||
| c | 47906 | 4.2% | |
| r | 40213 | 3.5% | |
| [ | 36243 | 3.2% | |
| ] | 36243 | 3.2% | |
| d | 25966 | 2.3% | |
| t | 23752 | 2.1% | |
| v | 23597 | 2.1% | |
| C | 21421 | 1.9% | |
| s | 21221 | 1.9% | |
| V | 20446 | 1.8% | |
| h | 15203 | 1.3% | |
| u | 12215 | 1.1% | |
| S | 11469 | 1.0% | |
| f | 11291 | 1.0% | |
| I | 11279 | 1.0% | |
| O | 11270 | 1.0% | |
| M | 10323 | 0.9% | |
| Other values (43) | 145821 | 12.9% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ا | 115 | 19.7% | |
| ر | 66 | 11.3% | |
| ل | 52 | 8.9% | |
| و | 37 | 6.3% | |
| ز | 35 | 6.0% | |
| ن | 33 | 5.6% | |
| ج | 32 | 5.5% | |
| ي | 23 | 3.9% | |
| م | 20 | 3.4% | |
| ی | 19 | 3.2% | |
| ئ | 15 | 2.6% | |
| ک | 13 | 2.2% | |
| ب | 13 | 2.2% | |
| د | 13 | 2.2% | |
| ت | 8 | 1.4% | |
| ع | 8 | 1.4% | |
| خ | 7 | 1.2% | |
| ك | 7 | 1.2% | |
| ط | 7 | 1.2% | |
| ے | 6 | 1.0% | |
| س | 5 | 0.9% | |
| ح | 5 | 0.9% | |
| ة | 5 | 0.9% | |
| ش | 4 | 0.7% | |
| چ | 4 | 0.7% | |
| Other values (11) | 33 | 5.6% |
Most frequent Katakana characters
| Value | Count | Frequency (%) | |
| ー | 123 | 72.8% | |
| ン | 4 | 2.4% | |
| コ | 3 | 1.8% | |
| ナ | 3 | 1.8% | |
| ウ | 3 | 1.8% | |
| イ | 3 | 1.8% | |
| ル | 3 | 1.8% | |
| ク | 3 | 1.8% | |
| ロ | 2 | 1.2% | |
| ス | 2 | 1.2% | |
| ワ | 2 | 1.2% | |
| チ | 2 | 1.2% | |
| ト | 2 | 1.2% | |
| モ | 1 | 0.6% | |
| デ | 1 | 0.6% | |
| レ | 1 | 0.6% | |
| ジ | 1 | 0.6% | |
| ャ | 1 | 0.6% | |
| ゲ | 1 | 0.6% | |
| ム | 1 | 0.6% | |
| ラ | 1 | 0.6% | |
| マ | 1 | 0.6% | |
| ア | 1 | 0.6% | |
| オ | 1 | 0.6% | |
| リ | 1 | 0.6% | |
| Other values (2) | 2 | 1.2% |
Most frequent Latin 1 Sup characters
| Value | Count | Frequency (%) | |
| ó | 12 | 20.0% | |
| ü | 8 | 13.3% | |
| é | 8 | 13.3% | |
| á | 8 | 13.3% | |
| í | 6 | 10.0% | |
| ö | 3 | 5.0% | |
| Í | 2 | 3.3% | |
| ñ | 2 | 3.3% | |
| Ç | 2 | 3.3% | |
| ä | 2 | 3.3% | |
| Ó | 1 | 1.7% | |
| è | 1 | 1.7% | |
| ú | 1 | 1.7% | |
| ý | 1 | 1.7% | |
| à | 1 | 1.7% | |
| ï | 1 | 1.7% | |
| ç | 1 | 1.7% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| о | 15 | 14.2% | |
| а | 8 | 7.5% | |
| в | 7 | 6.6% | |
| с | 7 | 6.6% | |
| к | 6 | 5.7% | |
| и | 6 | 5.7% | |
| д | 6 | 5.7% | |
| н | 6 | 5.7% | |
| л | 5 | 4.7% | |
| р | 4 | 3.8% | |
| у | 4 | 3.8% | |
| т | 4 | 3.8% | |
| е | 4 | 3.8% | |
| м | 3 | 2.8% | |
| П | 2 | 1.9% | |
| ж | 2 | 1.9% | |
| ч | 2 | 1.9% | |
| б | 2 | 1.9% | |
| і | 1 | 0.9% | |
| С | 1 | 0.9% | |
| Я | 1 | 0.9% | |
| Р | 1 | 0.9% | |
| я | 1 | 0.9% | |
| г | 1 | 0.9% | |
| п | 1 | 0.9% | |
| Other values (6) | 6 | 5.7% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 이 | 9 | 9.5% | |
| 소 | 5 | 5.3% | |
| 스 | 5 | 5.3% | |
| 트 | 5 | 5.3% | |
| 해 | 3 | 3.2% | |
| 방 | 3 | 3.2% | |
| 탄 | 3 | 3.2% | |
| 년 | 3 | 3.2% | |
| 단 | 3 | 3.2% | |
| 와 | 3 | 3.2% | |
| 꽃 | 2 | 2.1% | |
| 불 | 2 | 2.1% | |
| 놀 | 2 | 2.1% | |
| 야 | 2 | 2.1% | |
| 달 | 2 | 2.1% | |
| 의 | 2 | 2.1% | |
| 녀 | 2 | 2.1% | |
| 레 | 2 | 2.1% | |
| 키 | 2 | 2.1% | |
| 즈 | 2 | 2.1% | |
| 코 | 1 | 1.1% | |
| 로 | 1 | 1.1% | |
| 나 | 1 | 1.1% | |
| 영 | 1 | 1.1% | |
| 국 | 1 | 1.1% | |
| Other values (28) | 28 | 29.5% |
Most frequent Phonetic Ext characters
| Value | Count | Frequency (%) | |
| ᴠ | 87 | 27.4% | |
| ᴇ | 86 | 27.0% | |
| ᴄ | 65 | 20.4% | |
| ᴀ | 64 | 20.1% | |
| ᴛ | 6 | 1.9% | |
| ᴅ | 5 | 1.6% | |
| ᴍ | 3 | 0.9% | |
| ᴏ | 2 | 0.6% |
Most frequent IPA Ext characters
| Value | Count | Frequency (%) | |
| ɪ | 64 | 51.2% | |
| ɴ | 33 | 26.4% | |
| ʟ | 27 | 21.6% | |
| ɢ | 1 | 0.8% |
Most frequent CJK characters
| Value | Count | Frequency (%) | |
| 疫 | 4 | 3.8% | |
| 科 | 4 | 3.8% | |
| 興 | 4 | 3.8% | |
| 彩 | 4 | 3.8% | |
| 苗 | 3 | 2.9% | |
| 新 | 2 | 1.9% | |
| 型 | 2 | 1.9% | |
| 支 | 2 | 1.9% | |
| 募 | 2 | 1.9% | |
| 集 | 2 | 1.9% | |
| 派 | 2 | 1.9% | |
| 六 | 2 | 1.9% | |
| 合 | 2 | 1.9% | |
| 連 | 2 | 1.9% | |
| 豬 | 2 | 1.9% | |
| 藍 | 2 | 1.9% | |
| 絲 | 2 | 1.9% | |
| 相 | 2 | 1.9% | |
| 信 | 2 | 1.9% | |
| 政 | 2 | 1.9% | |
| 府 | 2 | 1.9% | |
| 接 | 2 | 1.9% | |
| 種 | 2 | 1.9% | |
| 死 | 2 | 1.9% | |
| 亡 | 2 | 1.9% | |
| Other values (42) | 45 | 43.3% |
Most frequent Thai characters
| Value | Count | Frequency (%) | |
| ว | 16 | 11.0% | |
| ค | 13 | 9.0% | |
| น | 12 | 8.3% | |
| ั | 9 | 6.2% | |
| ี | 9 | 6.2% | |
| ิ | 9 | 6.2% | |
| โ | 7 | 4.8% | |
| ่ | 7 | 4.8% | |
| ด | 6 | 4.1% | |
| ซ | 5 | 3.4% | |
| ม | 4 | 2.8% | |
| า | 4 | 2.8% | |
| จ | 4 | 2.8% | |
| ้ | 3 | 2.1% | |
| อ | 3 | 2.1% | |
| ร | 3 | 2.1% | |
| ก | 3 | 2.1% | |
| เ | 3 | 2.1% | |
| แ | 2 | 1.4% | |
| ็ | 2 | 1.4% | |
| ู | 2 | 1.4% | |
| ไ | 2 | 1.4% | |
| ๊ | 2 | 1.4% | |
| ต | 2 | 1.4% | |
| ล | 2 | 1.4% | |
| Other values (10) | 11 | 7.6% |
Most frequent Latin Ext A characters
| Value | Count | Frequency (%) | |
| ı | 4 | 28.6% | |
| č | 3 | 21.4% | |
| ş | 2 | 14.3% | |
| İ | 2 | 14.3% | |
| ğ | 2 | 14.3% | |
| ć | 1 | 7.1% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| र | 43 | 18.9% | |
| द | 21 | 9.3% | |
| े | 15 | 6.6% | |
| न | 12 | 5.3% | |
| ् | 12 | 5.3% | |
| ी | 12 | 5.3% | |
| ि | 12 | 5.3% | |
| ब | 11 | 4.8% | |
| ा | 11 | 4.8% | |
| ं | 10 | 4.4% | |
| ो | 9 | 4.0% | |
| ज | 8 | 3.5% | |
| क | 7 | 3.1% | |
| ए | 6 | 2.6% | |
| औ | 6 | 2.6% | |
| ह | 6 | 2.6% | |
| म | 5 | 2.2% | |
| ग | 5 | 2.2% | |
| व | 5 | 2.2% | |
| स | 5 | 2.2% | |
| त | 2 | 0.9% | |
| य | 1 | 0.4% | |
| प | 1 | 0.4% | |
| ू | 1 | 0.4% | |
| ण | 1 | 0.4% |
Most frequent Tamil characters
| Value | Count | Frequency (%) | |
| த | 1 | 16.7% | |
| ல | 1 | 16.7% | |
| ை | 1 | 16.7% | |
| வ | 1 | 16.7% | |
| ர | 1 | 16.7% | |
| ே | 1 | 16.7% |
Most frequent Myanmar characters
| Value | Count | Frequency (%) | |
| ် | 4 | 12.9% | |
| ု | 3 | 9.7% | |
| က | 3 | 9.7% | |
| တ | 2 | 6.5% | |
| ိ | 2 | 6.5% | |
| င | 2 | 6.5% | |
| ာ | 2 | 6.5% | |
| း | 2 | 6.5% | |
| ရ | 1 | 3.2% | |
| န | 1 | 3.2% | |
| ံ | 1 | 3.2% | |
| ဗ | 1 | 3.2% | |
| စ | 1 | 3.2% | |
| ွ | 1 | 3.2% | |
| ယ | 1 | 3.2% | |
| ဆ | 1 | 3.2% | |
| ေ | 1 | 3.2% | |
| မ | 1 | 3.2% | |
| ျ | 1 | 3.2% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| ο | 13 | 17.3% | |
| μ | 9 | 12.0% | |
| ι | 6 | 8.0% | |
| α | 5 | 6.7% | |
| ς | 4 | 5.3% | |
| ε | 4 | 5.3% | |
| β | 4 | 5.3% | |
| λ | 4 | 5.3% | |
| σ | 4 | 5.3% | |
| κ | 2 | 2.7% | |
| ρ | 2 | 2.7% | |
| ν | 2 | 2.7% | |
| Δ | 2 | 2.7% | |
| Μ | 2 | 2.7% | |
| Ε | 2 | 2.7% | |
| υ | 2 | 2.7% | |
| τ | 2 | 2.7% | |
| ω | 1 | 1.3% | |
| Θ | 1 | 1.3% | |
| έ | 1 | 1.3% | |
| Σ | 1 | 1.3% | |
| π | 1 | 1.3% | |
| ί | 1 | 1.3% |
Most frequent Hiragana characters
| Value | Count | Frequency (%) | |
| ま | 2 | 10.0% | |
| え | 2 | 10.0% | |
| ら | 2 | 10.0% | |
| の | 2 | 10.0% | |
| お | 1 | 5.0% | |
| げ | 1 | 5.0% | |
| て | 1 | 5.0% | |
| け | 1 | 5.0% | |
| た | 1 | 5.0% | |
| に | 1 | 5.0% | |
| は | 1 | 5.0% | |
| し | 1 | 5.0% | |
| い | 1 | 5.0% | |
| こ | 1 | 5.0% | |
| と | 1 | 5.0% | |
| よ | 1 | 5.0% |
Most frequent Diacriticals characters
| Value | Count | Frequency (%) | |
| ̇ | 5 | 100.0% |
| Distinct | 171 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 42 |
| Missing (%) | 0.1% |
| Memory size | 360.0 KiB |
| Twitter Web App | |
|---|---|
| Twitter for iPhone | |
| Twitter for Android | |
| TweetDeck | |
| Twitter for iPad | 995 |
| Other values (166) |
| Value | Count | Frequency (%) | |
| Twitter Web App | 14538 | 31.6% | |
| Twitter for iPhone | 13516 | 29.3% | |
| Twitter for Android | 11763 | 25.5% | |
| TweetDeck | 2157 | 4.7% | |
| Twitter for iPad | 995 | 2.2% | |
| 745 | 1.6% | ||
| Hootsuite Inc. | 482 | 1.0% | |
| Buffer | 199 | 0.4% | |
| Twitter Media Studio | 175 | 0.4% | |
| IFTTT | 113 | 0.2% | |
| Etus Brasil | 90 | 0.2% | |
| WordPress.com | 85 | 0.2% | |
| Hocalwire Social Share | 80 | 0.2% | |
| Sprout Social | 79 | 0.2% | |
| Twitter Media Studio - LiveCut | 50 | 0.1% | |
| 48 | 0.1% | ||
| Blog2Social APP | 48 | 0.1% | |
| Twitter for Mac | 42 | 0.1% | |
| Tickeron | 35 | 0.1% | |
| Tweetbot for iΟS | 32 | 0.1% | |
| SocialFlow | 32 | 0.1% | |
| dlvr.it | 31 | 0.1% | |
| IndiaPost | 27 | 0.1% | |
| Smarp. | 26 | 0.1% | |
| Flying Eze | 25 | 0.1% | |
| Other values (146) | 604 | 1.3% | |
| (Missing) | 42 | 0.1% |
Frequencies of value counts
Unique
| Unique | 53 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 32 |
|---|---|
| Median length | 18 |
| Mean length | 16.43685273 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 86984 | 11.5% | |
| 83643 | 11.0% | ||
| r | 81260 | 10.7% | |
| e | 77703 | 10.3% | |
| i | 69530 | 9.2% | |
| o | 54156 | 7.2% | |
| T | 43838 | 5.8% | |
| w | 43551 | 5.8% | |
| p | 29328 | 3.9% | |
| n | 27113 | 3.6% | |
| f | 26858 | 3.5% | |
| A | 26453 | 3.5% | |
| d | 25388 | 3.4% | |
| P | 14830 | 2.0% | |
| b | 14726 | 1.9% | |
| W | 14647 | 1.9% | |
| h | 13738 | 1.8% | |
| a | 3761 | 0.5% | |
| c | 3373 | 0.4% | |
| k | 2344 | 0.3% | |
| D | 2188 | 0.3% | |
| s | 1913 | 0.3% | |
| I | 1439 | 0.2% | |
| u | 1267 | 0.2% | |
| m | 956 | 0.1% | |
| Other values (45) | 6078 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 566042 | 74.8% | |
| Uppercase Letter | 106462 | 14.1% | |
| Space Separator | 83649 | 11.0% | |
| Other Punctuation | 697 | 0.1% | |
| Decimal Number | 89 | < 0.1% | |
| Dash Punctuation | 86 | < 0.1% | |
| Connector Punctuation | 20 | < 0.1% | |
| Open Punctuation | 7 | < 0.1% | |
| Close Punctuation | 7 | < 0.1% | |
| Other Symbol | 6 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| T | 43838 | 41.2% | |
| A | 26453 | 24.8% | |
| P | 14830 | 13.9% | |
| W | 14647 | 13.8% | |
| D | 2188 | 2.1% | |
| I | 1439 | 1.4% | |
| S | 933 | 0.9% | |
| H | 598 | 0.6% | |
| B | 358 | 0.3% | |
| M | 328 | 0.3% | |
| F | 196 | 0.2% | |
| E | 171 | 0.2% | |
| L | 116 | 0.1% | |
| C | 97 | 0.1% | |
| N | 78 | 0.1% | |
| O | 34 | < 0.1% | |
| Ο | 32 | < 0.1% | |
| R | 31 | < 0.1% | |
| V | 21 | < 0.1% | |
| Z | 19 | < 0.1% | |
| G | 18 | < 0.1% | |
| U | 13 | < 0.1% | |
| K | 12 | < 0.1% | |
| X | 7 | < 0.1% | |
| J | 3 | < 0.1% | |
| Other values (2) | 2 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 86984 | 15.4% | |
| r | 81260 | 14.4% | |
| e | 77703 | 13.7% | |
| i | 69530 | 12.3% | |
| o | 54156 | 9.6% | |
| w | 43551 | 7.7% | |
| p | 29328 | 5.2% | |
| n | 27113 | 4.8% | |
| f | 26858 | 4.7% | |
| d | 25388 | 4.5% | |
| b | 14726 | 2.6% | |
| h | 13738 | 2.4% | |
| a | 3761 | 0.7% | |
| c | 3373 | 0.6% | |
| k | 2344 | 0.4% | |
| s | 1913 | 0.3% | |
| u | 1267 | 0.2% | |
| m | 956 | 0.2% | |
| l | 943 | 0.2% | |
| g | 919 | 0.2% | |
| v | 128 | < 0.1% | |
| y | 54 | < 0.1% | |
| z | 27 | < 0.1% | |
| x | 13 | < 0.1% | |
| q | 8 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 83643 | > 99.9% | ||
| 6 | < 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 690 | 99.0% | |
| , | 4 | 0.6% | |
| : | 3 | 0.4% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 69 | 77.5% | |
| 4 | 13 | 14.6% | |
| 1 | 2 | 2.2% | |
| 0 | 2 | 2.2% | |
| 6 | 1 | 1.1% | |
| 7 | 1 | 1.1% | |
| 5 | 1 | 1.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 86 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 20 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 7 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 7 | 100.0% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| 🦉 | 6 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 672472 | 88.8% | |
| Common | 84561 | 11.2% | |
| Greek | 32 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 86984 | 12.9% | |
| r | 81260 | 12.1% | |
| e | 77703 | 11.6% | |
| i | 69530 | 10.3% | |
| o | 54156 | 8.1% | |
| T | 43838 | 6.5% | |
| w | 43551 | 6.5% | |
| p | 29328 | 4.4% | |
| n | 27113 | 4.0% | |
| f | 26858 | 4.0% | |
| A | 26453 | 3.9% | |
| d | 25388 | 3.8% | |
| P | 14830 | 2.2% | |
| b | 14726 | 2.2% | |
| W | 14647 | 2.2% | |
| h | 13738 | 2.0% | |
| a | 3761 | 0.6% | |
| c | 3373 | 0.5% | |
| k | 2344 | 0.3% | |
| D | 2188 | 0.3% | |
| s | 1913 | 0.3% | |
| I | 1439 | 0.2% | |
| u | 1267 | 0.2% | |
| m | 956 | 0.1% | |
| l | 943 | 0.1% | |
| Other values (27) | 4185 | 0.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 83643 | 98.9% | ||
| . | 690 | 0.8% | |
| - | 86 | 0.1% | |
| 2 | 69 | 0.1% | |
| _ | 20 | < 0.1% | |
| 4 | 13 | < 0.1% | |
| ( | 7 | < 0.1% | |
| ) | 7 | < 0.1% | |
| 6 | < 0.1% | ||
| 🦉 | 6 | < 0.1% | |
| , | 4 | < 0.1% | |
| : | 3 | < 0.1% | |
| 1 | 2 | < 0.1% | |
| 0 | 2 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
Most frequent Greek characters
| Value | Count | Frequency (%) | |
| Ο | 32 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 757021 | > 99.9% | |
| None | 38 | < 0.1% | |
| Latin 1 Sup | 6 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 86984 | 11.5% | |
| 83643 | 11.0% | ||
| r | 81260 | 10.7% | |
| e | 77703 | 10.3% | |
| i | 69530 | 9.2% | |
| o | 54156 | 7.2% | |
| T | 43838 | 5.8% | |
| w | 43551 | 5.8% | |
| p | 29328 | 3.9% | |
| n | 27113 | 3.6% | |
| f | 26858 | 3.5% | |
| A | 26453 | 3.5% | |
| d | 25388 | 3.4% | |
| P | 14830 | 2.0% | |
| b | 14726 | 1.9% | |
| W | 14647 | 1.9% | |
| h | 13738 | 1.8% | |
| a | 3761 | 0.5% | |
| c | 3373 | 0.4% | |
| k | 2344 | 0.3% | |
| D | 2188 | 0.3% | |
| s | 1913 | 0.3% | |
| I | 1439 | 0.2% | |
| u | 1267 | 0.2% | |
| m | 956 | 0.1% | |
| Other values (42) | 6034 | 0.8% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| Ο | 32 | 84.2% | |
| 🦉 | 6 | 15.8% |
Most frequent Latin 1 Sup characters
| Value | Count | Frequency (%) | |
| 6 | 100.0% |
| Distinct | 239 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.084196357 |
|---|---|
| Minimum | 0 |
| Maximum | 6683 |
| Zeros | 30075 |
| Zeros (%) | 65.3% |
| Memory size | 360.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 7 |
| Maximum | 6683 |
| Range | 6683 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 44.72176481 |
|---|---|
| Coefficient of variation (CV) | 14.50029753 |
| Kurtosis | 11458.64401 |
| Mean | 3.084196357 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 88.08390544 |
| Sum | 142055 |
| Variance | 2000.036247 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 30075 | 65.3% | |
| 1 | 7887 | 17.1% | |
| 2 | 2553 | 5.5% | |
| 3 | 1337 | 2.9% | |
| 4 | 806 | 1.7% | |
| 5 | 478 | 1.0% | |
| 6 | 368 | 0.8% | |
| 7 | 300 | 0.7% | |
| 8 | 245 | 0.5% | |
| 9 | 182 | 0.4% | |
| 10 | 149 | 0.3% | |
| 12 | 142 | 0.3% | |
| 11 | 118 | 0.3% | |
| 13 | 109 | 0.2% | |
| 14 | 90 | 0.2% | |
| 15 | 87 | 0.2% | |
| 16 | 68 | 0.1% | |
| 18 | 51 | 0.1% | |
| 17 | 50 | 0.1% | |
| 19 | 41 | 0.1% | |
| 20 | 37 | 0.1% | |
| 23 | 32 | 0.1% | |
| 24 | 29 | 0.1% | |
| 25 | 29 | 0.1% | |
| 27 | 27 | 0.1% | |
| Other values (214) | 769 | 1.7% |
| Value | Count | Frequency (%) | |
| 0 | 30075 | 65.3% | |
| 1 | 7887 | 17.1% | |
| 2 | 2553 | 5.5% | |
| 3 | 1337 | 2.9% | |
| 4 | 806 | 1.7% | |
| 5 | 478 | 1.0% | |
| 6 | 368 | 0.8% | |
| 7 | 300 | 0.7% | |
| 8 | 245 | 0.5% | |
| 9 | 182 | 0.4% |
| Value | Count | Frequency (%) | |
| 6683 | 1 | < 0.1% | |
| 2360 | 1 | < 0.1% | |
| 2247 | 1 | < 0.1% | |
| 2095 | 1 | < 0.1% | |
| 1980 | 2 | < 0.1% | |
| 1515 | 1 | < 0.1% | |
| 1281 | 1 | < 0.1% | |
| 938 | 1 | < 0.1% | |
| 922 | 1 | < 0.1% | |
| 870 | 1 | < 0.1% |
| Distinct | 503 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.52170043 |
|---|---|
| Minimum | 0 |
| Maximum | 22815 |
| Zeros | 19255 |
| Zeros (%) | 41.8% |
| Memory size | 360.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 31 |
| Maximum | 22815 |
| Range | 22815 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 191.984916 |
|---|---|
| Coefficient of variation (CV) | 14.19828202 |
| Kurtosis | 6283.978438 |
| Mean | 13.52170043 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 66.48283436 |
| Sum | 622796 |
| Variance | 36858.20798 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 19255 | 41.8% | |
| 1 | 8478 | 18.4% | |
| 2 | 4196 | 9.1% | |
| 3 | 2503 | 5.4% | |
| 4 | 1679 | 3.6% | |
| 5 | 1189 | 2.6% | |
| 6 | 974 | 2.1% | |
| 7 | 722 | 1.6% | |
| 8 | 593 | 1.3% | |
| 9 | 469 | 1.0% | |
| 11 | 396 | 0.9% | |
| 10 | 376 | 0.8% | |
| 12 | 323 | 0.7% | |
| 13 | 277 | 0.6% | |
| 14 | 241 | 0.5% | |
| 15 | 224 | 0.5% | |
| 16 | 224 | 0.5% | |
| 17 | 187 | 0.4% | |
| 18 | 169 | 0.4% | |
| 20 | 148 | 0.3% | |
| 21 | 135 | 0.3% | |
| 19 | 133 | 0.3% | |
| 23 | 117 | 0.3% | |
| 22 | 108 | 0.2% | |
| 26 | 98 | 0.2% | |
| Other values (478) | 2845 | 6.2% |
| Value | Count | Frequency (%) | |
| 0 | 19255 | 41.8% | |
| 1 | 8478 | 18.4% | |
| 2 | 4196 | 9.1% | |
| 3 | 2503 | 5.4% | |
| 4 | 1679 | 3.6% | |
| 5 | 1189 | 2.6% | |
| 6 | 974 | 2.1% | |
| 7 | 722 | 1.6% | |
| 8 | 593 | 1.3% | |
| 9 | 469 | 1.0% |
| Value | Count | Frequency (%) | |
| 22815 | 1 | < 0.1% | |
| 17432 | 1 | < 0.1% | |
| 9458 | 1 | < 0.1% | |
| 8470 | 1 | < 0.1% | |
| 8153 | 1 | < 0.1% | |
| 8098 | 1 | < 0.1% | |
| 6651 | 1 | < 0.1% | |
| 6163 | 1 | < 0.1% | |
| 5827 | 1 | < 0.1% | |
| 5575 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| id | user_name | user_location | user_description | user_created | user_followers | user_friends | user_favourites | user_verified | date | text | hashtags | source | retweets | favorites | is_retweet | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1340539111971516416 | Rachel Roh | La Crescenta-Montrose, CA | Aggregator of Asian American news; scanning diverse sources 24/7/365. RT's, Follows and 'Likes' will fuel me 👩💻 | 2009-04-08 17:52:46 | 405 | 1692 | 3247 | False | 2020-12-20 06:06:44 | Same folks said daikon paste could treat a cytokine storm #PfizerBioNTech https://t.co/xeHhIMg1kF | ['PfizerBioNTech'] | Twitter for Android | 0 | 0 | False |
| 1 | 1338158543359250433 | Albert Fong | San Francisco, CA | Marketing dude, tech geek, heavy metal & '80s music junkie. Fascinated by meteorology and all things in the cloud. Opinions are my own. | 2009-09-21 15:27:30 | 834 | 666 | 178 | False | 2020-12-13 16:27:13 | While the world has been on the wrong side of history this year, hopefully, the biggest vaccination effort we've ev… https://t.co/dlCHrZjkhm | NaN | Twitter Web App | 1 | 1 | False |
| 2 | 1337858199140118533 | eli🇱🇹🇪🇺👌 | Your Bed | heil, hydra 🖐☺ | 2020-06-25 23:30:28 | 10 | 88 | 155 | False | 2020-12-12 20:33:45 | #coronavirus #SputnikV #AstraZeneca #PfizerBioNTech #Moderna #Covid_19 Russian vaccine is created to last 2-4 years… https://t.co/ieYlCKBr8P | ['coronavirus', 'SputnikV', 'AstraZeneca', 'PfizerBioNTech', 'Moderna', 'Covid_19'] | Twitter for Android | 0 | 0 | False |
| 3 | 1337855739918835717 | Charles Adler | Vancouver, BC - Canada | Hosting "CharlesAdlerTonight" Global News Radio Network. Weeknights 7 Pacific-10 Eastern - Email comments/ideas to charles@charlesadlertonight.ca | 2008-09-10 11:28:53 | 49165 | 3933 | 21853 | True | 2020-12-12 20:23:59 | Facts are immutable, Senator, even when you're not ethically sturdy enough to acknowledge them. (1) You were born i… https://t.co/jqgV18kch4 | NaN | Twitter Web App | 446 | 2129 | False |
| 4 | 1337854064604966912 | Citizen News Channel | NaN | Citizen News Channel bringing you an alternative news source from citizen journalists that haven't sold out. Real news & real views | 2020-04-23 17:58:42 | 152 | 580 | 1473 | False | 2020-12-12 20:17:19 | Explain to me again why we need a vaccine @BorisJohnson @MattHancock #whereareallthesickpeople #PfizerBioNTech… https://t.co/KxbSRoBEHq | ['whereareallthesickpeople', 'PfizerBioNTech'] | Twitter for iPhone | 0 | 0 | False |
| 5 | 1337852648389832708 | Dee | Birmingham, England | Gastroenterology trainee, Clinical Research Fellow in IBD, mother to human and fur baby, Canadian in Britain | 2020-01-26 21:43:12 | 105 | 108 | 106 | False | 2020-12-12 20:11:42 | Does anyone have any useful advice/guidance for whether the COVID vaccine is safe whilst breastfeeding?… https://t.co/EifsyQoeKN | NaN | Twitter for iPhone | 0 | 0 | False |
| 6 | 1337851215875608579 | Gunther Fehlinger | Austria, Ukraine and Kosovo | End North Stream 2 now - the pipeline of corruption, funding Russias war against Ukraine,Georgia, Syria and political intervention in USA and EU must be stopped | 2013-06-10 17:49:22 | 2731 | 5001 | 69344 | False | 2020-12-12 20:06:00 | it is a bit sad to claim the fame for success of #vaccination on patriotic competition between USA, Canada, UK and… https://t.co/IfMrAyGyTP | ['vaccination'] | Twitter Web App | 0 | 4 | False |
| 7 | 1337850832256176136 | Dr.Krutika Kuppalli | NaN | ID, Global Health, VHF, Pandemic Prep, Emerging Infections, & Health Policy MD| U.S. Congress COVID-19 expert witness x 2 | ELBI 2020 @JHSPH_CHS | 2019-03-25 04:14:29 | 21924 | 593 | 7815 | True | 2020-12-12 20:04:29 | There have not been many bright days in 2020 but here are some of the best \n1. #BidenHarris winning #Election2020… https://t.co/77u4f8XXfx | ['BidenHarris', 'Election2020'] | Twitter for iPhone | 2 | 22 | False |
| 8 | 1337850023531347969 | Erin Despas | NaN | Designing&selling on Teespring. Like 90s Disney tv movies, old school WWE. Dislikes Intolerance, hate, bigots and snakes https://t.co/fa5n4gEHgR | 2009-10-30 17:53:54 | 887 | 1515 | 9639 | False | 2020-12-12 20:01:16 | Covid vaccine; You getting it?\n\n #CovidVaccine #covid19 #PfizerBioNTech #Moderna | ['CovidVaccine', 'covid19', 'PfizerBioNTech', 'Moderna'] | Twitter Web App | 2 | 1 | False |
| 9 | 1337842295857623042 | Ch.Amjad Ali | Islamabad | #ProudPakistani #LovePakArmy #PMIK @insafianspower1\n#PoliticalScience #InternationalAffairs \n#PAKUSTV #Newyork #Islamabad | 2012-11-12 04:18:12 | 671 | 2368 | 20469 | False | 2020-12-12 19:30:33 | #CovidVaccine \n\nStates will start getting #COVID19Vaccine Monday, #US says \n#pakustv #NYC #Healthcare #GlobalGoals… https://t.co/MksOvBvs5w | ['CovidVaccine', 'COVID19Vaccine', 'US', 'pakustv', 'NYC', 'Healthcare', 'GlobalGoals'] | Twitter Web App | 0 | 0 | False |
Last rows
| id | user_name | user_location | user_description | user_created | user_followers | user_friends | user_favourites | user_verified | date | text | hashtags | source | retweets | favorites | is_retweet | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 46049 | 1376099384056766465 | mmnjug™ | Kenya | From where I sit: There is a certain majesty in simplicity. | 2009-04-08 07:48:10 | 42248 | 1792 | 2658 | False | 2021-03-28 09:10:34 | For #SputnikV, its storage temperature is -18°C Its two doses are given 21 days apart, unlike the eight weeks apart… https://t.co/1LkJT4lGPR | ['SputnikV'] | Twitter for Android | 0 | 0 | False |
| 46050 | 1376099072474546176 | mmnjug™ | Kenya | From where I sit: There is a certain majesty in simplicity. | 2009-04-08 07:48:10 | 42248 | 1792 | 2658 | False | 2021-03-28 09:09:20 | #SputnikV coronavirus vaccine offers around 92% protection against Covid-19, according to late-stage trial results published in The Lancet. | ['SputnikV'] | Twitter for Android | 0 | 0 | False |
| 46051 | 1376096823102865412 | Passionate Panafricanist | NaN | The greatest war, the #Negro; Africa faces is the #ideological WAR. When WE OVERCOME IT, we WILL CONQUER THE #WORLD. | 2021-01-12 13:42:33 | 129 | 849 | 2505 | False | 2021-03-28 09:00:23 | @Consumers_Kenya @MOH_Kenya #SputnikV has good ratings globally and FRANKLY MAYBE ITS TIME TO TRUST WORKING WITH… https://t.co/fs4F23CmYr | ['SputnikV'] | Twitter for Android | 0 | 0 | False |
| 46052 | 1376094143399796736 | mmnjug™ | Kenya | From where I sit: There is a certain majesty in simplicity. | 2009-04-08 07:48:10 | 42248 | 1792 | 2658 | False | 2021-03-28 08:49:45 | Questions have emerged on who, between the Pharmacy & Poisons Board and its parent @MOH_Kenya, is telling the truth… https://t.co/hw2xSPjFju | NaN | Twitter for Android | 6 | 4 | False |
| 46053 | 1376084087312637963 | Douglas Herbert | Paris | Paris-based commentator at @France24. Also an avid tweeter of historical photos and artworks that catch my fancy. Instagram: @dougherbertf24 | 2009-02-27 17:18:30 | 9265 | 2413 | 1529 | True | 2021-03-28 08:09:47 | Universal access: Some shopping malls in the Urals capital city of #Yekaterinburg are offering the #SputnikV vaccin… https://t.co/z9zgK4lkG1 | ['Yekaterinburg', 'SputnikV'] | Twitter for iPhone | 0 | 6 | False |
| 46054 | 1376080077054746624 | Consumer Grassroots | Kenya | Official Account for Consumer Grassroots Association (CGA). We Empower Consumers to Protect Themselves. Stay informed to stay safe. Updates https://t.co/AflCO8VWrU | 2016-11-30 08:14:19 | 23021 | 447 | 19882 | True | 2021-03-28 07:53:51 | Russian Covid-19 vaccine #SputnikV now in Kenya. On 25th March 2021, @MOH_Kenya warned Kenyans against taking the v… https://t.co/Lgu8YSOrjJ | ['SputnikV'] | Twitter Web App | 1 | 2 | False |
| 46055 | 1376073682381107201 | Michael Muchiri | Nairobi | Civil Engineer working in Kenya. With a Passion for development and maintenance of infrastructure. | 2011-01-31 17:39:38 | 1595 | 3515 | 6795 | False | 2021-03-28 07:28:26 | Communique on COVID19 Lockdown Effects on USIU University Programmes this Semester.\n@ExperienceUSIU @USIUAlumni… https://t.co/qr3eiQ8xgr | NaN | Twitter for Android | 0 | 0 | False |
| 46056 | 1376068500360470529 | Michael Muchiri | Nairobi | Civil Engineer working in Kenya. With a Passion for development and maintenance of infrastructure. | 2011-01-31 17:39:38 | 1595 | 3515 | 6795 | False | 2021-03-28 07:07:51 | Mask is worn on the face, for protection, disguise, performance, or entertainment; ceremonial & practical purposes,… https://t.co/UPP1isIFLx | NaN | Twitter for Android | 0 | 0 | False |
| 46057 | 1376058766454624261 | Stankevicius International | Dublin, Ireland | Professional trading consultant specializing in contracting and due diligence with a strong presence and network in international markets. | 2020-06-30 12:31:42 | 16 | 3 | 0 | False | 2021-03-28 06:29:10 | Selling: #NitrileGloves, #1860 #FaceMasks, #Vaccines #SputnikV, #syringes. Contact sales: https://t.co/gWmRopLARO o… https://t.co/iRFg3e9lRc | ['NitrileGloves', 'FaceMasks', 'Vaccines', 'SputnikV', 'syringes'] | IFTTT | 0 | 1 | False |
| 46058 | 1376057426793861122 | Firras Jabar | Iraq | NaN | 2018-10-29 13:16:05 | 45 | 205 | 4492 | False | 2021-03-28 06:23:51 | #Novartis. #Pfizer #vaccine #VaccinePassports #Sinopharm #astrazenecavaccine #SputnikV #COVID19 @save_children… https://t.co/pcLE5yU6IF | ['Novartis', 'Pfizer', 'vaccine', 'VaccinePassports', 'Sinopharm', 'astrazenecavaccine', 'SputnikV', 'COVID19'] | Twitter for Android | 0 | 0 | False |